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Description 

Background of the invention 

5 [0001] The pharmaceutical, the fine chemicals and the food industry need a number of compounds that have to be 
isolated from complex mixtures such as extracts of animal or plant tissue, or fermentation broth. Often these isolation 
processes determine the price of the product. 

[0002] Conventional isolation processes are not very specific and during the isolation processes the compound to 
be isolated is diluted considerably with the consequence that expensive steps for removing water or other solvents 
10 have to be applied. 

[0003] For the isolation of some specific compounds affinity techniques are used. The advantage of these techniques 
is that the compounds bind very specifically to a certain ligand. However these ligands are quite often very expensive. 
[0004] To avoid spillage of these expensive ligands they can be linked to an insoluble support. However, often linking 
the ligand is also expensive and, moreover, the functionality of the ligand is often affected negatively by such procedure. 
15 [0005] So a need exists for developing cheap processes for preparing highly effective immobilized ligands. 

[0006] The non-prepublished co-pending international patent application WO 94/01567 (UNILEVER) discloses a 
method for immobilizing an enzyme or functional part thereof on the cell wall of a lower eukaryote, whereby the enzyme 
or functional part thereof is localized at the exterior of the cell wall. 

[0007] WO 92/20805 discloses the production of heterologous proteins in Gram-positive bacteria wherein the het- 
20 erologous proteins are bound to the surface of the bacteria by a cell wall spanning and membrane anchoring amino 
acid sequence. 

[0008] J.E. Francisco et al, Proc. Natl. Acad. Sci. USA 89 (1992) 2713-2717, disclose a fusion protein obtained by 
the expression of a chimeric gene consisting of (i) the signal sequence and first nine N-terminal amino acids of the 
mature major E. coli lipoprotein, (ii) amino acids 46-159 of the outer membrane OmpA, and (iii) the complete mature 
25 p-lactamase sequence. 

[0009] The fusion protein had an enzymatically active p-lactamase and was found predominantly in the outer mem- 
brane. A substantial fraction (20-30%) of the p-lactamase domain of the protein was exposed on the external surface 
of E. coli. 

[0010] In J. Bacterid. Yn_ (1989) 4569-4576, the anchoring of p-galactosidase to the E. coli inner plasma membrane 
30 is disclosed. It is suggested that by combining a signal sequence at the N-terminus with the hydrophobic domain at 
the N-terminus, many proteins, although not p-galactosidase, could become associated with the outer aspect of the 
inner membrane or with the outer membrane. Such modified bacteria might have potential use as immobilized enzyme 
systems. 

[0011] S.W. Hiebert et al, J. Cell Biology 107 (1988) 865-876, describe that the soluble cytoplasmic protein pyruvate 
35 kinase (PK) has been expressed at the cell surface of CV1 cells in a membrane-anchored form (APK). Truncated forms 

of the APK molecule, with up to 80% of the PK region of APK removed, can also be expressed at the cell surface. 

[0012] In WO 89/07140 a tripartite DNA sequence is described that is so fused that the gene will upon expression 

in a eukaryotic cell (exemplified are COS cells) give rise to a correctly processed, appropriately folded, membrane 

bound version of the protein. Various anchor domains are mentioned. 
40 [0013] C.C. Chen et al, J. Biol. Chem. 265 (1990) 3161-3167, describe that the predicted amino acid sequence 

reveals the presence of a proline-rich cell wall anchor region similar to that of other Gram-positive surface proteins. 

[0014] Vijaya et al, Mol. Cell. Biol. 8 (1988) 1709-1714, describe the cell surface localization of hybrid proteins on 

the outer surface of CV-1 cells infected by recombinant viruses. 

45 Summary of the invention 

[0015] The invention is a process for carrying out an isolation process by using an immobilized binding protein or 
functional part thereof still capable of binding to a specific compound, wherein a medium containing said specific com- 
pound is contacted with a host cell according to the invention under conditions whereby a complex between said specific 
50 compound and said immobilized binding protein is formed, separating said complex from the medium originally con- 
taining said specific compound and, optionally, releasing said specific compound from said binding protein or functional 
part thereof. 

[0016] Preferably, the host is selected from Gram-positive bacteria and fungi, which have a cell wall at the outside 
of the host cell, in contrast to Gram-negative bacteria and cells of higher eukaryotes such as animal cells and plant 
55 cells, which have a membrane at the outside of their cells. Suitable Gram-positive bacteria comprise lactic acid bacteria 
and bacteria belonging to the genera Bacillus and Streptomyces. Suitable fungi comprise yeasts belonging to the 
genera Candida, Debaryomyces, Hansenula, Kluyveromyces, Pichia and Saccharomyces, and moulds belonging to 
the genera Aspergillus, Penicillium and Rhizopus. In this specification the group of fungi comprises the group of yeasts 
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and the group of moulds, which are also known as lower eukaryotes. In contrast to the cells in plants and animals the 
group of bactena and lower eukaryotes are also indicated in this specification as microorganisms 
[0017] Recombinant polynucleotides capable of being used in a method as described above, are polynucleotides 
comprising (.) a structural gene encoding a binding protein or a functional part thereof still having the specific binding 
capability, and (n) at least part of a gene encoding an anchoring protein capable of anchoring in the cell wall of a Gram- 
positive bacterium or a fungus, said part of a gene encoding at least the anchoring part of said anchoring protein which 
anchoring part is derivable from the C-terminal part of said anchoring protein. 

[0018] The anchoring protein can be selected from cc-agglutinin, cc-agglutinin, FL01, the Major Cell Wall Protein of 
a lower eukaryote, and proteinase of lactic acid bacteria. Preferably, such polynucleotide further comprises a nucleotide 
sequence encoding a signal peptide ensuring secretion of the expression product of the polynucleotide which signal 
peptide can be derived from a protein selected from the a-mating factor of yeast, cc-agglutinin of yeast, invertase of 
Saccharomyces, inulinase of Kluyveromyces, cc-amylase of Bacillus, and proteinase of lactic acid bacteria The poly- 
nucleotide can be operably linked to a promoter, which is preferably an inducible promoter. 

'5 Brief description of the figures 

[0019] 

In Figure 1 the composition of pEMBL9-derived plasmid pUR4122 is indicated, the preparation of which is de- 
20 scribed in Example 1. 

In Figure 2 the composition of plasmid pUR2741 is indicated, which is a derivative of published plasmid pUR2740 
see Example 1. 

In Figure 3 the composition of pEMBL9-derived plasmid pUR2968 is indicated. Its preparation is described in 
Example 1 . 

25 In Figure 4 the preparation of plasmid pUR4174 starting from plasmids pUR2741, pUR2968 and pUR4122 is 

indicated, as well as the preparation of plasmid pUR4175 starting from plasmids pSY16, pUR2968 and pUR4122 
These preparations are described in Example 1 . 

In Figure 5 the composition of plasmid pUR2743.4 is indicated. Its preparation is described in Example 2 It contains 
the 714 bp Pst\-Xho\ fragment given in SEQ ID NO: 12, which fragment encodes an scFv-TRAS fragment of anti- 
3 o traseolide® antibody 02/01/01 . 

In Figure 6 the composition of plasmid pUR4178 is indicated. Its preparation is indicated in Example 2 It contains 
the above mentioned 714 bp Pst\-Xho\ fragment given in SEQ ID NO: 1 2. This plasmid is suitable for the expression 
of a fusion protein between scFv-TRAS and ocAGG preceded by the invertase signal sequence (SUC2). 
In Figure 7 the composition of plasmid pUR41 79 is indicated. Its preparation is indicated in Example 2 It contains 
the above mentioned 714 bp Pst\-Xho\ fragment given in SEQ ID NO: 12. This plasmid is suitable for the expression 
of a fusion protein between scFv-TRAS and oAGG preceded by the prepro-a-mating factor signal signal sequence 
In Figure 8 a molecular design picture is given, showing the musk odour molecule traseolide® and a modified 
musk antigen, described in Example 3. 

in Figure 9 the composition of plasmid pUR4177 is indicated. Its construction is described in Example 4 Plasmid 
PUR4177 contains the 734 bp Eag\-Xho\ DNA fragment given in SEQ ID NO: 13 encoding the variable regions of 
the heavy and light chain fragments from the monoclonal antibody directed against the human chorionic gonado- 
tropin (an scFv-HCG fragment) and is a 2 um-based vector suitable for production of the chimeric scFv HCG- 
otAGG fusion protein preceded by the invertase signal sequence and under the control of the GAL7 promoter. 
In Figure 1 0 the composition of plasmid pUR41 80 is indicated. Its preparation is indicated in Example 4 It contains 
the above mentioned 734 bp Eagi-Xho\ DNA fragment given in SEQ ID NO: 1 3 and is a 2 um-based vector suitable 
for production of the chimeric scFv-HCG-aAGG fusion protein preceded by the prepro-a-mating factor signal se- 
quence and under the control of the GAL7 promoter. 

In Figure 11 the composition of plasmid pUR2990, a 2 ^m-based vector, is indicated, which is suggested in Ex- 
ample 5 as a starting vector for the preparation of plasmid pUR4196 (see Figure 12). Plasmid pUR2990 contains 
a DNA fragment encoding a chimeric lipase-FL0 1 protein that will be anchored in the cell wall of a lower eukaryote 
and can catalyze lipid hydrolysis. 
In Figure 1 2 the composition of plasmid pUR41 96 is indicated. Its preparation is explained in Example 5. It contains 
a DNA fragment encoding a chimeric protein comprising the scFv-HCG followed by the C-terminal part of the 
FLOl -protein, and is a vector suitable for the production of a chimeric protein anchored in the cell wall of the host 
55 organism and can bind HCG. 

In Figure 1 3 the composition of plasmid pUR2985 is indicated. Its preparation is described in Example 6 It contains 
a choB gene coding for the mature part of the cholesterol oxidase (EC 1 .1 .3.6) obtained via PCR techniques from 
the chromosome of Brevibactehum sterolicum. 
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In Figure 14 the composition of plasmid pUR2987 is indicated. Its preparation from plasmid pUR2985 is described 
in Example 6. It contains a DNA sequence comprising the choB gene coding for the mature part of the cholesterol 
oxidase preceded by DNA encoding the prepro-a-mating factor signal sequence and followed by DNA encoding 
the C-terminal part of a-agglutinin. 
5 In Figure 15 the composition of the published plasmid pGKV550 is indicated. It is described in Example 7 and 

contains the complete cell wall proteinase operon of Lactococcus lactis subsp. cremoris Wg2, including the pro- 
moter, the ribosome binding site and the prtP gene. 

In Figure 16 the composition of plasmid pUR2988 is indicated. Its preparation is described in Example 7. It is 
anticipated that this plasmid can be used for preparing a further plasmid pUR2989, which after introduction in a 
10 lactic acid bacterium will be responsible for producing a chimeric protein that will be anchored at the outer surface 

of the lactic acid bacterium and is capable of binding cholesterol. 

In Figure 17 the composition of plasmid pUR2993 is indicated. Its preparation is described in Example 8. It is 
anticipated that this plasmid can be used for transforming yeast cells that can bind a human epidermal growth 
factor (EGF) through an anchored chimeric protein containing an EGF receptor. 
15 in Figure 1 8 the composition of plasmids pUR4482 and 4483 is indicated. Their preparation is described in Example 

9. Plasmid pUR4482 is a yeast episoma! expression plasmid for expression of a fusion protein with the invertase 
signal sequence, the CH V 09 variable region, the Myc-tail, and the M X-P-X-P H Hinge region of a camel antibody, 
and the a-agglutinin cell wall anchor region. Plasmid pUR4483 differs from pUR4482 in that it does not contain 
the "X-P-X-P" Hinge region. 

20 In Figure 19 immunofluorescent labelling (anti-Myc antibody) of SU10 cells in the exponential phase (OD 530 = 

0.5) expressing the genes of camel antibodies present on plasmids pUR4424, pUR4482 and pUR4483 is shown. 
Ph = phase contrast, Fl = fluorescence. 

In Figure 20 immunofluorescent labelling (anti-human IgG antibody) of SU10 cells in the exponential phase 
(OD 530 =0.5) expressing the genes of camel antibodies present on plasmids pUR4424, pUR4482 and pUR4483 is 
25 shown. 

Ph = phase contrast, Fl = fluorescence. 



Abbreviations used in the Figures: 



30 [0020] 





a-gal: 


gene encoding guar cc-galactosidase 




AG-alpha-1/AGa1: 


gene expressing a-agglutinin from S. cerevisiae 




AGa1 cds/a-AGG: 


coding sequence of a-agglutinin 


35 


Amp/amp r: 


p-lactamase resistance gene 




CHv09: 


camel heavy chain variable 09 fragment 




EmR: 


erythromycin resistance gene 




fl: 


phage f1 replication sequence 




FL01/FLO (C-part): 


C-terminal part of FLOl coding sequence of flocculation protein 


40 


Hinge: 


Camel "X-P-X-P" Hinge region, see Example 9 




LEU2: 


LEU2 gene 




LEU2d/Leu2d: 


truncated LEU2 gene 




Leu 2d cs: 


coding sequence LEU2d gene 




MycT: 


camel Myc-tail 


45 


Ori MB1: 


origin of replication MB1 derived from 5. coli plasmid 




Pga17/pGAL7: 


GAL7 promoter 




Tpgk: 


terminator of the phosphoglyceratekinase gene 




ppa-MF/MFa1ss: 


prepro-part of a-mating factor (= signal sequence) 




repA: 


gene encoding the repA protein required for replication (Fig. 15/16). 


50 


ScFv (Vh-VI): 


single chain antibody fragment containing V H and V L chains 




ss: 


signal sequence 




SUC2: 


invertase signal sequence 




2u/2 micron: 


2nm sequence 



55 Detailed description of the invention 

[0021] The present invention relates to the isolation of valuable compounds from complex mixtures by making use 
of immobilized ligands. The immobilized ligands can be proteins obtainable via genetic engineering and can consist 
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of two parts, namely both an anchoring protein or functional part thereof and a binding protein or functional part thereof. 
[0022] The anchoring protein sticks into cell walls of microorganisms, preferably lower eukaryotes, e.g. yeasts and 
moulds. Often this type of proteins has a long C-terminal part that anchors it in the cell wall. These C-terminal parts 
have very special amino acid sequences. A typical example is anchoring via C-terminal sequences of proteins enriched 
5 in proline, see Kok (1990). 

[0023] The C-terminal part of these anchoring proteins can contain a substantial number of potential serine and 
threonine glycosylation sites. O-glycosylation of these sites gives a rod-like conformation to the C-terminal part of these 
proteins. 

In the case of anchored manno-proteins they seem to be linked to the gtucan in the cell wall of lower eukaryotes, as 

10 they cannot be extracted from the cell wall with sodium dodecyi sulphate (SOS), but can be liberated by glucanase 
treatment, see our co-pending patent application WO-94/01 567 (UNILEVER) published 20 January 1 994 and Schreud- 
er c.s. (1993), both being published after the claimed priority date. WO-A-94/01567 discloses the use of anchored 
enzymes in an enzymatic process. Another mechanism to anchor proteins at the outer side of a cell is to make use of 
the property that a protein containing a glycosyl-phosphatidyl-inosito! (GPI) group anchors via this GPI group to the 

15 cell surface, see Conzelmann c.s. (1990). 

[0024] The binding protein is so called, because it ligates or binds to the specific compound to be isolated. If the N- 
terminal part of the anchoring protein is sufficiently capable of binding to a specific compound, the anchoring protein 
itself can be used in a process for isolating that specific compound. Suitable examples of a binding protein comprise 
an antibody, an antibody fragment, a combination of antibody fragments, a receptor protein, an inactivated enzyme 

20 still capable of binding the corresponding substrate, and a peptide obtained via Applied Molecular Evolution, see Lewin 
(1990), as well as a part of any of these proteinaceous substances still capable of binding to the specific compound 
to be isolated. Ail these binding proteins are characterized by specific recognition of the compounds or group of related 
compounds to be isolated. The binding rate and release rate, and therefore the binding constant between the specific 
compound to be isolated and the binding protein, can be regulated either by changing the composition of the liquid 

25 extract in which the compound is present or, preferably, by changing the binding protein by protein engineering. 

[0025} The gene coding for the chimeric protein comprising both the binding protein and the anchoring protein (or 
functional parts thereof) can be placed under control of a constitutive, inducible or derepressible promoter and will 
generally be preceded by a DNA fragment encoding a signal sequence ensuring efficient secretion of the chimeric 
protein. Upon secretion the chimeric protein will be anchored in the cell wall of the microorganisms, thereby covering 

30 the surface of the microorganisms with the chimeric protein. These microorganisms can be obtained in normal fermen- 
tation processes and their isolation is a cheap process, when physical separation processes are used, e.g. centrifu- 
gation or membrane filtration. 

[0026] After washing, the isolated microorganisms can be added to liquid extracts containing the valuable specific 
compound or compounds. After some time the equilibrium between the bound and free specific compound(s) will be 

35 reached and the microorganisms to which the specific compound or group of related compounds is bound can be 
separated from the extract by simple physical techniques. Alternatively, the microorganisms covered with ligands can 
be brought on a support material and subsequently this coated support material can be used in a column. 
[0027] The liquid extract containing the specific compound or compounds of interest can be added to the column 
and afterwards the compound(s) can be released from the ligand by changing the composition of the eluting liquid or 

46 the temperature or both. A skilled person will recognize that in addition to these two possibilities other modifications 
can be used for effecting the binding of the specific compound and the ligand, their subsequent isolation and/or the 
release of the specific compound(s). 

[0028] In particular the invention relates to chimeric proteins that are bound to the cell wall of lower eukaryotes. 
Suitable lower eukaryotes comprise yeasts, e.g. Candida, Debaryomyces, Hansenula, Kluyveromyces, Pichia and 
45 Saccharomyces, and moulds e.g. Aspergillus, Penicillium and Rhizopus. For some applications prokaryotes are also 
applicable, especially Gram-positive bacteria, examples of which include lactic acid bacteria, and bacteria belonging 
to the genera Bacillus and Streptomyces. 

[0029] For lower eukaryotes the present invention provides genes encoding chimeric proteins consisting of: 

50 a. a DNA sequence encoding a signal sequence functional in a lower eukaryotic host, e.g. derived from a yeast 

protein including the a-mating factor, in vertase, ct-agglutinin, inulinase or derived from a mould protein e.g. xyla- 
nase; 

b. a structural gene encoding a C-terminal part of a cell wall protein preceded by a structural gene encoding a 
protein, that is capable of binding to the specific compound or group of compounds of interest, examples of which 
55 include 

an antibody, 

- a single chain antibody fragment (scFv; see Bird and Webb Walker (1991), 
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- a variable region of the heavy chain (V H ) or a variable region of the light chain (V L ) of an antibody or that part 
of such variable region still containing one to three of the complementarity determining regions (CDRs), 

- an agonist-recognizing part of a receptor protein or a part thereof still capable of binding the agonist, 

- a catalytically inactivated enzyme, or a fragment of such enzyme still containing a substrate binding site of the 
enzyme, 

- specific lipid binding proteins or parts of these proteins still containing the lipid binding site(s), see Ossendorp 
(1992), and 

- a peptide that has been obtained via Applied Molecular Evolution, see Lewin (1990). 

[0030] All expression products of these genes are characterized in that they consists of a signal sequence and both 
a protein part, that is capable of binding to the compound(s) to be isolated, and a C-terminus of a typically cell wall 
bound protein, examples of the latter including a-agglutinin, see Lipke c.s. (1989), a-agglutinin, see Roy c.s. (1991), 
FL01 (see Example 5 and SEQ ID NO: 14) and the Major Cell Wall Protein of lower eukaryotes, which C-terminus is 
capable of anchoring the expression product in the cell wall of the lower eukaryote host organism. 
[0031] The expression of these genes encoding chimeric proteins can be under control of a constitutive promoter, 
but an inducible promoter is preferred, suitable examples of which include the GAL7 promoter from Saccharomyces, 
the inulinase promoter from Kluyveromyces, the methanol-oxidase promoter from Hansenuia, and the xylanase pro- 
moter of Aspergillus. Preferably the constructs are made in such a way that the new genetic information is integrated 
in a stable way in the chromosome of the host cell, see e.g. WO-91/00920 (UNILEVER). 

[0032] The lower eukaryotes transformed with the above mentioned genes can be grown in normal fermentation, 
continous fermentation, or fed batch fermentation processes. 

[0033] The selection of a suitable process for growing the microorganism will depend on the construction of the gene 
and the promoter used, and on the desired purity of the cells after the physical separation procedure(s). 
[0034] For bacteria the present invention deals with genes encoding chimeric proteins consisting of: 

a. a DNA sequence encoding a signal sequence functional in the specific bacterium, e.g. derived from a Bacillus 
a-amylase, a Bacillus subtilis subtilisin, or a Lactococcus lactis subsp. cremoris proteinase; 

b. a structural gene encoding a C-terminal part of a cell wall protein preceded by a structural gene encoding a 
protein capable of binding to the specific compound or group of compounds of interest, examples of which are 
given above for a lower eukaryote. 

[0035] All expression products of these genes are characterized in that they consist of a signal sequence and both 
a protein part, that is capable of binding to the specific compound or specific group of compounds to be isolated, and 
a C-terminus of a typically cell wall-bound protein such as the proteinase of Lactococcus lactis subsp. cremons strain 
Wg2, see Kok c.s. (1988) and Kok (1990). the C-terminus of which is capable of anchoring the expression product in 
the cell wall of the host bacterium. 

[0036] The invention is illustrated with the following Examples without being limited thereto. First the endonuclease 
restriction sites mentioned in the Examples are given. 
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BstEII G GTNACC 
CCANTG G 

EcoRl G AATTC 
CTTAA G 



Clal 



NotI 
Sad 



GAGCT C 
C TCGAG 



AT CGAT 
TAGC TA 



ifindlll A AGCTT 
TTCGA A 



GC GGCCGC Nrul 
CGCCGG CG 



Sail 



TCG CGA 
AGC GCT 

G TCGAC 
CAGCT G 



EagI 
Nhel 
PStI 

Xhol 



C GGCCG 
GCCGG C 

G CTAGC 
CGATC G 

CTGCA G 
G ACGTC 

C TCGAG 
GAGCT C 
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Example 1. Construction of a gene encoding a chimeric protein that will be anchored in the cell wall of a lower 
eukaryote and is able to bind with high specificity lysozyme from a complex mixture. 

[0037] Lysozyme is an anti-microbial enzyme with a number of applications in the pharmaceutical and food industries. 

5 Several sources of lysozyme are known, e.g. egg yolk or a fermentation broth containing a microorganism producing 
lysozyme. Monoclonal antibodies have been raised against lysozyme, see Ward c.s. (1 989), and the mRNA's encoding 
the light and heavy chains of such antibodies have been isolated from the hybridoma cells and used as template for 
the synthesis of cDNA using reverse transcriptase. Starting from the plasmids as described by Ward c.s. (1989), we 
constructed a pEMBL-derived plasmid, designated pUR4122 t in which the multiple cloning site of the pEMBL-vector, 

10 ranging from the EcoRI to the H/ndlll site, was replaced by a 231 bp DNA fragment, whose nucleotide sequence is 
given in SEQ ID NO: 1 and has an EcoRI site (GAATTC) at nucleotides 1-6, a Psfl site (CTGCAG) at nucleotides 
1 05-1 1 0, a BsfEII site (GGTCACC) at nucleotides 1 22-1 28, a Xnol site (CTCGAG) at nucleotides 207-21 2, and a H/ndlll 
site (AAGCTT) at nucleotides 226-231. 

15 Construction of pUR4122 

[0038] Plasmid pEMBL9, see Dente c.s. (1983), was digested with EcoRI and H/ndlll and the resulting large fragment 
was ligated with the double stranded synthetic DNA fragment given in SEO ID NO: 1. For the successive ligation of 
DNA fragments, which finally form the coding sequence of a single chain antibody fragment for lysozyme, the following 

20 elements were combined in the 231 bp DNA fragment (SEQ ID NO: 1 ) inserted into the pEMBL-9 vector: the 3* part of 
the GAL7 promoter, the invertase signal sequence (SUC2), a Pst\ restriction site, a SsfEII restriction site, a sequence 
encoding the (GGGGS)x3 peptide linker connecting the V H and V L fragments, a Saci restriction site, a Xnol restriction 
site and a H/ndlll restriction site, resulting in plasmid pUR4119. To obtain the in frame fusion between V H and the 
GGGGS-linker plasmid pSW1-VHD1.3-VKD1.3-TAG1, see Ward c.s. (1989), was digested with Pstt and BsfEII and 

25 a DNA fragment of 0.35 kbp was ligated in the correspondingly digested pUR4119 resulting in plasmid pUR4119A. 
Subsequently the plasmid pSW1-VHD1 .3-VKD1 .3-TAG1 was digested with Sad and Xnol and this fragment containing 
the coding part of V L was finally ligated into the SacUXhol sites of pUR41 1 9A, resulting in plasmid pUR41 22 (see Figure 
1). 

30 Construction of pUR4174, see Figure 4 

[0039] To obtain S. cerevisiae episomal expression plasmids containing DNA encoding a cell wall anchor derived 
from the C-terminal part of a-agglutinin, plasmid pUR2741 (see Figure 2) was selected as starting vector. Basically, 
this plasmid is a derivative of pUR2740, which is a derivative of plasmid pUR2730 as described in WO-91/19782 
35 (UNILEVER) and by Verbakel (1991). The preparation of pUR2730 is clearly described in Example 9 of EP-A1 -0255153 
(UNILEVER). Plasmid pUR2741 differs from plasmid pUR2740 in that the Eagl restriction site within the remaining 
part of the already inactive let resistance gene was deleted through Nru\ISal\ digestion. The Sail site was filled in prior 
to religation. 

[0040] After digesting pUR4122 with Sad (partially) and H/ndlll, the approximately 800 bp fragment was isolated 
40 and cloned into the pUR2741 vector fragment, which was obtained after digestion of pUR2741 with the same enzymes. 
The resulting plasmid was named pUR4125. 

[0041] A plasmid named pUR2968 (see Figure 3) was made by (1 ) digesting with H/ndlll the Agaf-containing plasmid 
pLa21 published by Lipke c.s. (1 989), (2) isolating an about 6.1 kbp fragment and (3) ligating that fragment with H/ndlll- 
treated pEMBL9, so that the 6.1 kbp fragment was introduced into the H/ndlll site present in the multiple cloning site 
45 of the pEMBL9 vector. 

[0042] Plasmid pUR4125 was digested with Xho\ and H/ndlll and the about 8 kbp fragment was ligated with the 
approximately 1.4 kbp A/nel-H/ndlll fragment of pUR2968, using XhoUNhel adapters having the following sequence: 



so XhoX Nhel 

5 " - TC GAG ATC AAA GGC GGA TCT G -3 ' = SEQ ID NO: 2 

3'- C TAG TTT CCG CCT AGA CGATC -S ' = SEQ ID NO: 3. 

[0043] The plasmid resulting from the ligation of the appropriate parts of plasmids pUR2968, pUR4125 and Xholl 
55 Nhe\ adapters, was designated pUR4174 and encodes a chimeric fusion protein at the amino terminus consisting of 
the invertase signal (pre) peptide, followed by the scFv-LYS polypeptide and, finally, the C-terminal part of a-agglutinin 
(see Figure 4). 
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Construction of pUR4175, see Figure 4 

[0044] Upon digesting pUR4122 (see above) with Pst\ and H/ndlll, the approximately 700 bp fragment was isolated 
and iigated into a vector fragment of plasmid pSY16, see Harmsen c.s. (1993), which was digested with Eagl and 
H/ndlll and using Eag\-Pst\ adapters, having the following sequence: 

EagX Pstl 
5 , -G_GCC_SCC CAG GTG CAG CTG CA- 3 1 = SEQ ID NO: 4 

3 « - CGG GTC CAC GTC G —5 1 = SEQ ID NO: 5 

[0045] The resulting plasmid, named pUR41 32, was digested with Xho\ and H/ndlll and Iigated with the approximately 
1.4 kbp Nhe\-Hin6\\\ fragment of pUR2968 (see above), using Xho\/Nhe\ adapters as described above, resulting in 
pUR4175 (see Figure 4). This plasmid contains a gene encoding a chimeric protein consisting of the ct-mating factor 
prepro-peptide, followed by the scFv-LYS polypeptide and, finally, the C-terminal part of a-agglutinin. 

Example 2 Construction of genes encoding a series of homologous chimeric proteins that will be anchored 
in the cell wall of a lower eukaryote and are able to bind with high specificities the musk fragrance trascol.de® 
from a complex mixture. 

[0046] The isolation of RNA from the hybridoma cell lines, the preparation of cDNA and amplification of gene frag- 
ments encoding the variable regions of antibodies by PCR was performed according to standard procedures known 
from the literature, see e.g. Orlandi c.s. (1989). For the PCR amplification different oligonucleotide primers have been 
used. 

[0047] For the heavy chain fragment: 

A: AGG TSM AB fTCC AGS AGT CWG G ■ SEQ ID NO: 6 

Pstl 

in which S is C or G, M is A or C, R is A or G, and W is A or T, 
and 

B: TGA GGA GAC GOT QAC CGT GGT CCC TTG GCC CC 

BstEU = SE< -* 1D NO: 7 - 

For the light chain fragment (Kappa): 

C GAC ATT G_AG_£EC ACC CAG TCT CCA - SEQ ID NO: 8, 

Sacl 



and 



D: GTTTGATOLCGA^CTTGGTCCC - SEQ ID NO: 9. 

Xltol 

Construction of pUR4143 

[0048] To simplify future construction work an Eagl restriction site was introduced in pUR4122 (see above), at the 
junction between the invertase signal sequence and the scFv-LYS. This was achieved by replacing the about 10 bp 
EcoRI-Psfl fragment within the synthetic fragment given in SEQ ID NO: 1 by synthetic adapters with the following 
sequence: 
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EcoRl PsA 
AATTC GGCCGTTCAGGTGCAGCTGCa = SEQ ID NO: 10 

GCCGGCAAGTCCACGTCG = SEQ ID NO: 11- 



[0049] The resulting plasmid was designated pUR4 1 22. 1 : a construction vector for single chain Fv assembly in frame 
behind an Eagl site for expression behind either the prepro-a-mating factor sequence or the SUC2 invertase signal 
10 sequence. 

[0050] After digesting the heavy chain PCR fragment with Psfl and BstEU, two fragments were obtained: a Pst\ 
fragment of about 230 bp and a PstV BstEU fragment of about 110 bp. The latter fragment was cloned into vector 
pUR4122.1, which was digested with Pst\ and SsfElL The newly obtained plasmid (pUR4122.2) was digested with 
Sad and Xhol after which the light chain PCR fragment (digested with the same restriction enzymes) was cloned into 
15 the vector, resulting in pUR41223. This plasmid was digested with Psfl, after which the above described about 230 bp 
Pst\ fragment was cloned into the plasmid vector, resulting in a plasmid called pUR41 43. Two orientations are possible, 
but selection can be made by restriction analysis, as usual. Instead of the scFv-LYS gene originally present in pUR41 22, 
this new plasmid pUR4143 contains a gene encoding an scFv-TRAS fragment of anti-traseolide antibody 02/01/01 (for 
the nucleotide sequence of the 714 bp Pst\-Xho\ fragment see SEQ ID NO: 12). 



20 



55 



Construction of pUR4178 and pUR4179. 



[0051] After digesting pUR4143 with Eag\ and with H/ndlll, an about 715 bp fragment can be isolated. Subsequentely, 
this fragment can be cloned into the vector backbone fragments of pUR2741 and pUR4175, that were digested with 
25 the same restriction enzymes. In the case of pUR2741 , this resulted in plasmid pUR2743.4 (see Figure 5). This plasmid 
can subsequently be cleaved with Xhoi and H/ndlll and ligated with the about 8 kbp Xnol-H/ndlll fragment of pUR41 74, 
resulting in pUR4178 (see Figure 6). 

[0052] In the situation where pUR4175 was used as a starting vector, the resulting plasmid was designated pUR4179 
(see Figure 7). 

30 Both plasmids, pUR4178 and pUR4179 were introduced into S. cerevisiae. 

Example 3. The modification of the binding parts of the chimeric protein that can bind traseolide® in order to 
improve the binding or release of traseolide® under certain conditions. 

35 [0053] Modification of binding properties of antibodies during the immune response is a well known immunological 
phenomenon originating from the fine tuning of complementarity determining sequences in the antibody's binding region 
to the antigen's molecular properties. This phenomenon can be mimicked in vitro by adjusting the antigen binding 
regions of antibody fragments based on molecular models of these regions in contact with the antigen. 
[0054] One such example consists of protein engineering the antimusk antibody M02/01/01 to a stronger binding 

40 variant M020501L 

[0055] First, a molecular model of M02/01/01 variable fragment (Fv) was constructed by homology modelling, using 
the coordinates of the anti-lysozyme antibody HYHEL-10 as a template (Brookhaven Protein Data Bank entry: 3HFM). 
This model was refined using Molecular Mechanics and Molecular Dynamics methods from within the Biosym program 
DISCOVER, on a Silicon Graphics 4D240 workstation. Secondly, the binding site of the resulting Fv was mapped by 
45 visually docking the musk antigen into the CDR region, followed by a refinement using molecular dynamics again. 
Upon inspection of the resulting model for packing efficiency (van der Waals contact areas), it was concluded that 
substitution of ALA H96 by VAL would increase the (hydrophobic) contact area between the ligand and Fv, and con- 
sequently lead to a stronger interaction (see Figure 8). 

[0056] When this mutation is introduced into M02/01/01 , the cDNA-derived scFv from Example 2, the result will be 
so Fv M020501 i; a variant with an increased affinity of at least a factor of 5 can be expected, and the increased affinity 
could be measured using fluorescence titration of the Fv with the musk odour molecule. 

Example 4. Construction of a gene encoding a chimeric protein that will be anchored in the cell wall of lower 
eukaryote and is able to bind hormones such as HCG. 

[0057] Gene fragments, encoding the variable regions of the heavy and light chain fragments from the monoclonal 
antibody directed against the human chorionic gonadotropin were obtained from a hybridoma cell line in a similar way 
as described in Example 2. 
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[0058] Subsequently, these HCG and gene fragments were cloned into plasmid pUR4143 by replacing the 
corresponding Psfl-SsrEII and Sacl-Xnol gene fragments, resulting in plasmid pl)R4146. 

[0059] Similar to the method described in Example 2, the 734 bp Eag\-Xho\ fragment (nucleotide sequence given in 
SEQ ID NO: 13) encoding the variable regions of the heavy and light chain fragments from the monoclonal antibody 
5 directed against the human chorionic gonadotropin (an scFv-HCG fragment) was isolated from pUR4146 and was 
introduced into the vector backbone fragment of pUR4178 (see Example 2) and will be introduced into the vector 
backbone fragment of pUR4175 (see Example 1), both digested with the same restriction enzymes. The resulting 
plasmids pUR4177 (see Figure 9) was, and pUR4180 (see Figure 10) will be, introduced into S. cerevisiae strain SU10. 

10 Example 5. Construction of a gene encoding a chimeric scFv-FL01 protein that will be anchored in the cell 
wall of lower eukaryote and is able to bind hormones such as HCG. 

[0060] One of the genes associated with the flocculation phenotype in S. cerevisiae is the FL01 gene. The DNA 
sequence of a clone containing major parts of the FL01 gene has been determined, see SEQ ID NO: 14 giving 2685 

15 bp of the FLOl gene. The cloned fragment appeared to be approximately 2 kb shorter than the genomic copy as judged 
from Southern and Northern hybridizations, but encloses both ends of the FL01 gene. Analysis of the DNA sequence 
data indicates that the putative protein contains at the N-terminus a hydrophobic region which confirms a signal se- 
quence for secretion, a hydrophobic C-terminus that might function as a signal for the attachment of a GPt-anchor and 
many glycosylation sites, especially in the C-terminus, with 46.6% serine and threonine in the arbitrarily defined C- 

20 terminus (aa 271-894). Hence, it is likely that the FL01 gene product is located in an orientated fashion in the yeast 
cell wall and may be directly involved in the process of interaction with neighbouring cells. 

[0061] The cloned FL01 sequence might therefore be suitable for the immobilization of proteins or peptides on the 
cell surface by a different type of cell wall anchor. 

[0062] For the production of a chimeric protein comprising the scFv-HCG followed by the C-terminal part of the 
25 FL01 -protein, plasmid pUR2990 (see Figure 1 1 ) can be used as a starting vector. The preparation of episomal plasmid 
pUR2990 was described in our co-pending patent application WO-94/01567 (UNILEVER) published on 20 January 
1994, i.e. during the priority year. Plasmid pUR2990 comprises the chimeric gene consisting of the gene encoding the 
Humicola lipase and a gene encoding the putative C-terminal cell wall anchor domain of the FL01 gene product, the 
chimeric gene being preceded by the invertase signal sequence (SUC2) and the GAL7 promoter; further the plasmid 
30 comprises the yeast 2 jim sequence, the defective Leu2 promoter described by Eckard and Hollenberg (1983), and 
the Leu2 gene, see Roy c.s. (1991). Plasmid pUR4146, described in Example 4, can be digested with Pstt and Xho\, 
and the about 0.7 kbp Pst\-Xho\ fragment containing the scFv-HCG coding sequence can be isolated. For the in frame 
fusion of this DNA sequence between the C-terminal FL01 part and the SUC2 signal sequence, the fragment can be 
directly ligated with the 9,3 kbp Eag\/Nhe\ (partial) backbone of plasmid pUR2990, resulting in plasmid pUR41 96 (see 
35 Figure 12). This plasmid will comprise an additional triplet encoding Ala at the transition between the SUC2 signal 
sequence and the start of the scFv-HCG, and a E-l-K-G-G amino acid sequence in front of the first amino acid (Ser) 
of the C part of FL01 protein. 

[0063] If in the previous Examples 1-5 the level of exposed antibody fragments is too low, the production level can 
he increased by mutagenesis of the frame work regions of the antibody fragment. This can he done in a site directed 
40 way or by (targeted) random mutagenesis, using techniques described in the literature. 

Example 6. Construction of a gene encoding a chimeric protein that will be anchored in the cell wall of a lower 
eukaryote and is able to bind cholesterol. 

45 [0064] In the literature two DNA sequences for cholesterol oxidase are described, the choB gene from Brevibacterium 
sterolicum, see Ohta c.s. (1991) and the choA gene from Streptomyces $p. SA-COO, see Ishizaka c.s. (1989). For 
the construction of a DNA fusion between the choB gene coding for cholesterol oxidase (EC 1.1.3.6) and the 3' part 
of the AG-a1 gene, the PCR technique on chromosomal DNA can be applied. Chromosomal DNA can be isolated by 
standard techniques from Brevibacterium sterolicum, and the DNA part coding for the mature part of the cholesterol 

50 oxidase can he amplified through application with the following corresponding PCR primers choOlpcr and cho02pcr: 



10 



EP0 682 710 B1 



10 



choQlpcr 

5 .. gcc ccc agc ccc acc ere g-3* = SEQ ID NO: 16 

3 ._ CGG GGG TOG GCC TGG GAG C-5 ' = SEQ ID NOt 17 

!!S ill Ml IM II! !t! \ «~ t~ w% ,o 

S « -AGATCT GAATTCCCCCCC GCC CCC AGC CCC ACC CTC G-3' = SEQ ID NO! 18 
ECORI tfOtl 
EagI 



cho02pcr 

Nhel Hin&ZXl 

3 • -TAG TAG AGC AGG CTG TAG GTC CGATCGACT2ICGMTCTAGA-5 ■ = SEQ ID NO: 19 

Ml I M I M Ml MI Ml MS 
75 $*-ATC ATC TCG TCC GAC ATC CAG-3* " SEQ ID NO: 20 

3 ■ -TAG TAG AGC AGG CTG TAG GTC-5 ' 85 SEQ ID NO: 21 



[0065] Both primers can specifically hybridize with the target sequence, thereby amplifying the coding part of the 
20 gene in such a way, that the specific PCR product -after Proteinase K treatment and digestion with EcoRI and H/ndlll- 
can be directly cloned into a suitable vector, here preferably pTZ19R, see Mead c.s. (1986). This will result in plasmid 
pUR2985 (see Figure 13). 

[0066] In addition to the already mentioned restriction sites both PCR primers generate other restriction sites at the 
5' end and the 3' end of the 1 .5 kbp DNA fragment, which can be used later on to fuse the fragment in frame between 

25 either the SUC2 signal sequence or the prepro-a-mating factor signal sequence on one side and the C-terminus coding 
part of the ot-agglutinin gene on the other side. To facilitate the ligation behind the prepro-MF sequence a Nofl site is 
introduced at the 5' end of PCR oligonucleotide cho01 per, allowing for example, the exchange of the 731 bp Eag\INhe\ 
fragment containing the scFv-Lys coding sequence in pUR4175 for the choB coding sequence. 
[0067] To create an enzymatically inactive fusion protein between cholesterol oxidase and cc-agglutinin, the above 

30 described subcloning into pTZ1 9R can be used. Cholesterol oxidase is an FAD-dependent enzyme for which the crystal 
structure of the Brevibacterum sterolicum enzyme has been determined, see Vrielink c.s. (1991). The enzyme displays 
homology with the typical pattern of the FAD-binding domain with the Gly-X-Gly-X-X-Gly sequence near the N-terminus 
(amino acid 1 8-23). Site-directed in vitro mutagenesis on the plasmid pUR2985 according to the manufacturer's protocol 
(Muta-Gene kit, Bio-Rad) can be applied to inactivate the FAD-binding site through replacing the triplet(s) encoding 

35 the Gly residue(s) by triplets encoding other amino acids, thereby presumably inactivating the enzyme. E.g. the fol- 
lowing primer can be used for site-directed mutagenesis of 2 of the conserved Gly residues. 



40 



45 



3*- CGG GAG CAG TAG CGG TCA CGT ATG CCG CCA CGG CAG CGG CGC -5* 

Ml Mi Mt MS i I HI II Hi Ml IM III IM Ml Ml m 

5'- GCC CTC CTC ATC GGC ACT GGA TAC GCC CGT CCC GTC GCC GCO -3* 

Ala ' Gly Gly Gly Gly Ala Ala Ala 

4 4. 
Ala Ala 

pr = primer = SEQ ID NO: 22 

cs = coding strand = SEQ ID NO: 23 



[0068] As a result of the mutagenesis with the described primer, plasmid pUR2986 will be obtained. From this plasmid 
the DNA coding for the presumably inactivated cholesterol oxidase can be released as a 1527 bp fragment through 
50 Not\INhe\ digestion, and subsequently directly used to exchange the scFv-Lys coding sequence in pUR4175, thereby 
generating plasmid pU R2987 (see Figure 1 4). To obtain a variant yeast secretion vector, where the secretion is directed 
through the SUC2 signal sequence, for example the 1823 bp long Sac\INhe\ segment of plasmid pUR2986 can be 
used to replace the Sac\INhe\ fragment in pUR4174. 

[0069] This inactivation of the FAD-binding site might be preferable over other mutations, since an unchanged active 
55 centre can be expected to leave the binding properties of cholesterol oxidase for cholesterol unaltered. Instead of the 
described Gly-Ala exchanges at position 18 and 20 of the mature coding sequence, every other suitable amino acid 
change can also be performed. 

[0070] To inactivate the enzyme, site directed mutagenesis can be optionally immediately performed in the active 



11 



EP0 682 710 B1 

site cavity, for example through exchange of the Glu331 , a residue appropriately positioned to act as the proton ac- 
ceptor, thus generating a new variant of an immobilized, enzymatically inactive fusion protein. 

Example 7. Construction of a gene encoding a chimeric protein that will be anchored in the cell wall of a lactic 
acid bacterium and is able to bind cholesterol. 

[0071] It has been described that proteinase of Lactococcus Iactis subsp. cremoris is anchored to the cell wall through 
*s 127 ammo acid long C-terminal, see Kok c.s. (1988) and Kok (1990). In a way similar to that described in Example 
6, the cholesterol oxidase of Brevibacterium sterolicum (choB) can be immobilized on the surface of Lactococcus Iactis 
Fusions can be made can be made between the choB structural gene and the N-terminal signal sequence and the C- 
terminal anchor of the proteinase of Lactococcus lactic. Plasmid pGKV550 (see Figure 15) contains the complete 
proteinase operon of Lactococcus Iactis subsp. cremoris Wg2, including the promoter, a ribosome binding site and 
^9™^ encoding the already mentioned signal and anchor sequences, see Kok (1990). First a DNA fragment 
containing the main part of the signal sequence, flanked by a C/al site and an Eagl site can be constructed with PCR 
on pGKV550 as follows: 

Primer prtl: 

5»-AA GAT CTA ?CG ftyc TTG TTA GCC GGT ACA-3 • = SEQ ID NO: 24 

Proteinase gene (non coding strand): 

3'-TT CCC GA T AGC TAG AAC AAT CGG CCA TGT CAG-5' 

CIaI = SEQ ID NO: 25 



Proteinase gene: Gin Ala Lys 

5 » -GTC GGC GAA ATC CAA GCA AAG GCG GCT-3 ■ = SEQ ID NO' 26 

Primer pn2: = SEQ ID NO: 27 

3 ■ -CAG CCG CTT TAG GTT CGT TGC_C£G_CCC CCC TTC GAA CCC-5 ■ 

Eagl Hindlll 



[0072] After the PCR reaction as described in Example 6, the 98 bp long PCR fragment can be isolated and digested 
with C/al and HmdWl pGKV550 can subsequently be cleaved partially with C/al and completely with Hind\\\, after which 
digestions the vector fragment, containing the promoter, the ribosome binding site, the DNA fragment encoding the N- 
terminal 8 amino acids and the cell wall binding fragment containing the 1 27 C-terminal amino acids of the proteinase 
gene can be isolated on gel. 

[0073] A copy of the cholesterol oxidase gene, suitable for fusion with the prtP anchor domain can be produced by 
a PCR reaction using plasmid pUR2985 (Example 6) as template and a combination of primer choOlpcr (see Example 
6) and the following primer cho03pcr instead of primer cho02pcr: 

cho03pcr H/ndlH 

3 • -TAG TAG AGC AGO CTG TAG GTC CGA G TT CCA A CC TAG GC-5 • = SEQ ID NO" 40 

• • i tii iii iii iii ill III 
5 1 -ATC ATC TCG TCC GAC ATC CAG = SEQ ID NO: 20. 

[0074] The about 1 .53 kbp fragment generated by this reaction can be digested with Not\ and H/ndlll to produce a 
molecule which can subsequently be ligated with the large Eag\/Hin6\\\ fragment from pUR2988 (see Figure 16) The 
resulting plasmid, pUR2989, will contain the cholesterol oxidase coding sequence inserted between the signal se- 
quence and the C-terminal cell wall anchor domain of the proteinase gene. 

[0075] After introduction into Lactobacillus Iactis subsp. Iactis MG1363 by electroporation, this plasmid will express 
cholesterol oxidase under control of the proteinase promoter. The transport through the membrane will be mediated 
by the proteinase signal sequence and the immobilization of the cholesterol oxidase by the proteinase anchor As it is 
unlikely that the Lactococcus will secrete FAD as well, the cholesterol oxidase will not be active but will be capable to 
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bind cholesterol. 

Example 8. Construction of a gene encoding a chimeric protein that will be anchored in the cell wall of a lower 
eukaryote and is able to bind growth hormones, such as the epidermal growth factor. 

5 

[0076] For the isolation of larger amounts of human epidermal growth factor (EGF) the corresponding receptor can 
be used in form of a fusion between the binding domain and a C-terminal part of a-agglutinin as cell wall anchor. The 
complete cDNA sequence of the human epidermal growth factor is cloned and sequenced. For the construction of a 
fusion protein with EGF binding capacity the N-terminal part of the mature receptor until the central 23 amino acids 

10 transmenbrane region can be utilized. 

[0077] The plasmid pUR4175 can be used for the construction. Through digestion with Eag\ and Nhe\ (partial) a 731 
bp DNA fragment containing the sequence coding for scFv is released and can be replaced by a DNA fragment coding 
for the first 621 amino acids of human epidermal growth factor receptor. Initiating from an existing human cDNA library 
or otherwise through production of a cDNA library by standard techniques from preferentially EGF receptor overex- 

15 pressing cells, e.g. A431 carcinoma cells, see Ullrich c.s. (1984), further PCR can be applied for the generation of in 
frame linkage between the extracellular binding domain of the human growth factor receptor (amino acid 1-622) and 
the C-terminal part of a-agglutinin. 

[0078] PCR oligonucleotides for the in frame linkage of human epidermal growth factor receptor and the C-terminus 
of a-agglutinin. 

20 

a: PCR oligonucleotides for the transition between SUC2 signal sequence and the N-terminus of mature EGF 
receptor. 

25 > mature EGF receptor 

pri EGF1: Ala Leu Glu Lys Lys Val - SEQ ID NO: 28 

5 1 -GGG GCG GCC GC G CTG GAG GAA AAG AAA GTT TGC-3 1 

m>*+>i (it lit iii iii iii iii 1 1 i 

NOtl til (is ill Mi til iii lit 

3'-CGC TCA GCC CGA GAC CTC CTT TTC TTT CAA ACG 5« 

EGF rec (non-coding strand): = SEQ ID NO: 29 



30 



35 



40 



b: PCR oligonucleotides for the in frame transition between C terminus of the extracellular binding domain of EGF 
receptor and the C terminal part of a-agglutinin. 



Ala Thr 

GCC ACT- 3 1 = SEQ ID NO: 30 
= SEQ ID NO: 31 



EGF rec 


(coding strand): 








Asn 


Gly Pro 


He 


Pro 


Ser 


5'-AAT 


GGG CCT AAG 


ATC 


CCG 


TCC 


t i i 
i i i 


iii iii iii 
iii »tt iii 


i i t 
i i i 


i i i 

i i i 


t t t 
i i i 


3 f -TTA 


CCC GGA TTC 


TAG 


GGC 


AGG 



pr EGF2: Nhel Hindlll 

45 [0079] This fusion would result in an addition of 2 Ala amino acids between the signal sequence and the mature N- 
terminus of EGF receptor. 

[0080] The newly obtained 1.9 kbp PCR fragment can be digested with Notl and Nhe\ and directly ligated into the 
vectorpUR4175 after digesting with the same enzymes, resulting in plasmid pl)R2993 (see Figure 17), comprising the 
GAL7 promoter, the prepro-a-mating factor sequence, the chimeric EGF receptor binding domain gene / a-agglutinin 
50 gene, the yeast 2 urn sequence, the defective LEU2 promoter and the LEU2 gene. This plasmid can he transformed 
into S. cerevisiae and the transformed cells can be cultivated in YP medium whereby expression of the chimeric protein 
can be induced by adding galactose to the medium. 

Example 9. Construction of genes encoding a chimeric protein anchored to the cell wall of yeast, comprising 
55 a binding domain of a "Camelidae" heavy chain antibody' 

[0081] Recently it was described that camels as well as a number of related species (e.g. lamas) contain a consid- 
erable amount of IgG antibody molecules which are only composed of heavy-chain dimers, see Hamers-Casterman 
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c s (1993) Although these "heavy-chain" antibodies are devoid of light chains, it was demonstrated, that they never- 
theless have an extensive antigen-binding repertoire. In order to show that the variable regions of this type ofantibod.es 
can be produced and will be linked to the exterior of the cell wall of a yeast, the following constructs were prepared. 

5 Construction of pUR2997, pUR2998 and pUR2999 

[0082] The about 2.1 kbp Eagl-H/ndlll fragment of pUR41 77 (Example 4, Fig 9) was isolated. By using PCR tech- 
nology an EcoRI restriction site was introduced immediately upstream of the Eagl site, whereby the C of the EcoRI 
site is the same as the first C of the Eagl site. The thus obtained EcoRI-H/ndlll fragment was ligated .nto plasmid 
w P EMBL9 which was digested with EcoRI and H/ndlll, which resulted in pUR4177.A 

[00831 The EcoR\INhe\ fragment of plasmid pUR4177.A was replaced by the EcoRUNhel fragments of three Afferent 
synthetic DNA fragments (SEQ ID NO: 32, SEQ ID NO: 33, and SEQ ID NO: 34) resulting in pUR2997. pUR2998 and 
PUR2999. respectively. The about 1.5 kbp SsfE!l-H/ndlll fragments of pUR2997 and pUR2998 were isolated. 

15 Construction of pUR4421 

[0084] The multiple cloning site of plasmid pEMBL9, see Dente c.s. (1983), (ranging from the EcoRI to the H/ndlll 
site) was replaced by a synthetic DNA fragment having the nucleotide sequence given below, see SEQ ID NO: 35 
giving the coding strand and SEQ ID NO: 36 giving the non-coding strand. The 5'-part of this nucleotide sequence 

20 comprises an Eagl site, the first 4 codons of a Came/Wae V H gene fragment (nucleotides 16-27) and a Xhol site 
(CTCGAG) coinciding with codons 5 and 6 (nucleotides 28-33). The 3'-part comprises the last 5 codons of the Camel- 
idae V H gene (nucleotides 46-60) (part of which coincides with a BsfEII site), eleven codons of the Myc tail (nucleoUdes 
61-93) see SEQ ID NO: 35 containing these eleven codons and SEQ ID NO: 37 giving the amino ac.d sequence and 
an EcoRI site (GAATTC). The EcoRI site, originally present in pEMBL9, is not functional any more, because the 5 - 

« end of the nucleotide sequence contains AATTT instead of AATTC, indicated below as (EcoRI). The resulfng plasm.d 
is called pUR4421 . The Camelidae V H fragment starts with amino acids Q-V-K and ends w.th amino acids V-S-S. 



30 



35 



40 



55 



(EcoRI) Eagl Xhol BstEII 

5 ' - AATTT AG CGG CCGCCCAGGT GAAACTGCTC SAGTAAGTGA CTAAG£TJ££- SO 
3. i~ A TCGCC GGCGGGTCCA CTTTGACGAG CTCATTCACT GATTCCAGTG- 
5 Q V K 

-CGTCTCCTCA GAACAAAAAC TCATCTCAGA AGAGGATCTG AATTAATGAG- IOO 
-GCAGAGGAGT CTTGTTTTTG AGTAGAGTCT TCTCCTAGAC TTAATTACTC- 
VSS E Q K LISE EDL N * * 

= SEQ ID NO: 37 

EcoRI Hindlll 

— AATTCA TCAA ACGGTGATA -3 ' 119 = SEQ ID NO: 35 

-TTAAGTAGTT TGCCACTATT CGA -5' 123 = SEQ ID NO; 36 



Construction of pUR4424 

45 [0085] After digesting the plasmid pB09 with Xhol and BstEII, a DNA fragment of about 0.34 kbp was isolated from 
agarose gel. This fragment codes for a truncated V H fragment, missing both the first 4 and the last 5 am.no ac.ds of 
the Camelidae V H fragment. Plasmid pB09 was deposited as E. coll JM1 09 P B09 at the Centraal Bureau voor Sch.m- 
melcultures. Baam on 20 April 1993 with deposition number CBS 271 .93. The DNA and amino acid sequences of the 
Camel V H fragments followed by the Flag sequence as present in plasmid pB09 were given in Figure 6B of European 

50 patent application 93201239.6 (not yet published), which is herein incorporated by reference. The obtained about 
0 34 kbp fragment was cloned into pUR4421 . To this end plasmid pUR4421 was digested with Xhol and Mndlll. after 
which the about 4 kb vector fragment was isolated from an agarose gel. The resulting vector was ligated w.th the about 
0.34 kbp X/iol/BsfEII fragment and a synthetic DNA linker having the following sequence: 



BstEII Hindill „ 

GTCACCGTCTCCTCATAATGA - SEQ ID MO: 38 

GCAGAGGAGTATTACTTCGA " SEQ ID NO. 39 
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resulting in plasmid pUR4421-09. 

[0086] Plasmid pSY16 was digested with Eagl and H/ndlll, atter which the about 6.5 kbp long vector backbone was 
isolated and ligated with the about 0.38 kbp Eagl/H/ndlll fragment from pUR4421-09 resulting in pUR4424. 

5 Construction of pUR4482 and pUR4483 

[0087] From pUR4424 the about 0.44 kbp Sact-BsfEll fragment, coding for the invertase signal sequence and the 
camel heavy chain variable 09 (= CH V 09) fragment, was isolated as well as the about 6.3 kbp Sacl-H/ndlll vector 
fragment. The about 6.3 kbp fragment and the about 0.44 kbp fragment from pUR4424 were ligated with the BstEU-Hin- 

w dill fragment from pUR2997 or pUR2998 yielding pUR4482 and pUR4483, respectively. 

[0088] Plasmid pUR4482 is thus an yeast episomal expression plasmid for expression of a fusion protein with the 
invertase signal sequence, the CH V 09 variable region, the Myc-tail and the Camel "X-P-X-P" Hinge region, see Harri- 
ers -Casterman c.s. (1993), (1993), and the a-agglutinin cell wall anchor region. Plasmid pUR4483 differs from 
pUR4482 in that it contains the Myc-tail hut not the n X-P-X-P M Hinge region. Similarly, the BsfEII-H/indlll fragment from 

15 pUR2999 can be ligated with the about 6.3 kbp vector fragment and the about 0.44 kbp fragment from pUR4424, 
resulting in pUR4497, which will differ from pUR4482 in that it contains the "X-P-X-P" Hinge region but not the Myc-tail. 
[0089] The plasmids pUR4424, pUR4482 and pUR4483 were introduced into Saccharomyces cerevisiae SU10 by 
electroporation, and transformants were selected on plates lacking leucine. Transformants from SU10 with pUR4424, 
pUR4482 or pUR4483, respectively, were grown on YP with 5% galactose and analysed with immune-fluorescence 

20 microscopy, as described in Example 1 of our co-pending WO-94/01 567 (UNILEVER) published on 20 January 1994. 
This method was slightly modified to detect the chimeric proteins, containing both the camel antibody and the Myc tail, 
present at the cell surface. 

[0090] In one method a monoclonal mouse anti-Myc antibody was used as a first antibody to bind to the Myc part of 
the chimeric protein; subsequently a polyclonal antimouse 1g antiserum labeled with fluorescein isothiocyanate (= 
25 FITC) ex Sigma, Product No. F-0527, was used to detect the bound mouse antibody and a positive signal was deter- 
mined by fluorescence microscopy. 

[0091] In the other method a polyclonal rabbit anti-human IgG serum, which had earlier been proven to cross-react 
with the camel antibodies, was used as a first antibody to bind the camel antibody part of the chimeric protein; subse- 
quently a polyclonal anti-rabbit 1g antiserum labeled with FITC ex Sigma, Product No. F-0382, was used to detect the 
30 bound rabbit antibody and a positive signal was determined by fluorescence microscopy. 

[0092] The results in Figure 1 9 and Figure 20 show clearly that fluorescence can be observed on those cells in which 
a fusion protein of the CH V 09 fragment with the a-agglutinin cell wall anchor region is produced (pUR4482 and 
pUR4483). No fluorescence however, was visible on the cells which produce the CH V 09 fragment without this anchor 
(pUR4424), when viewed under the same circumstances. 

35 

Patent literature references: 

[0093] EP-A1 -02551 53 (UNILEVER) Production of guar alpha-galactosidase by hosts transformed by recombinant 
DNA methods. First priority date 03.06.86; published 03.02.88 
40 [0094] WO-91/00920 (UNILEVER) Process for preparing a protein by a fungus transformed by multicopy integration 
of an expression vector. First priority date 07.07.89; published 24.01.91 

[0095] WO-91/19782 (UNILEVER) Xylanase production. Priority date 19.06.90; published 26.12.91 
[0096J WO-94/01 567 (UNILEVER) Process for immobilizing enzymes to the cell wall of a microbial cell by producing 
a fusion protein. First priority date 08.07.92; published 20.01.94 
45 [0097] EP patent application 93201 239.6 (not yet published) Production of antibodies or (functionalized) fragments 
thereof derived from heavy chain immunoglobulins of Camelidae. Filing date 29.04.93 

Non-patent literature references: 

50 [0098] R.E. Bird & 8. Webb Walker Single chain antibody variable regions. TIBTECH 9 (April 1991) 132-137 

A. Conzelmann, C. Fankhauser & C. Desponds Myoinositol gets incorporated into numerous membrane glycopro- 
teins of Saccharomyces cerevisiae; incorporation is dependent on phosphomannomutase (SEC53). The EMBO Journal 
9, No.3 (1990) 653-661 

L. Dente, G. Cesareni & R. Cortese pEMBL: a new family of single stranded plasmids. Nucleic Acids Research 1J_. 
55 No. 6(1983) 1645-1655 

E. Erhart, & CP. Hollenberg The Presence of a Defective Leu2 Gene on 2u DNA Recombinant Plasmids of Saccha- 
romyces cerevisiae is Responsible for Curing and High Copy Number. Journal of Bacteriology. 156, No.2 (Nov. 1983) 
625-635 



15 



EP 0 682 710 B1 



C. Hamers-Casterman, T. Atarhouch, S.Muyldermans, G. Robinson, C. Hamers, E. Bajyana Songa, N. Bendah- 
man & R. Hamers Naturally occurring antibodies devoid of light chains. Nature 363 (3 June 1 993) 446-448. 

M.M. Harmsen, A.C. Langedijk, E. van Tuinen, R.H. Geerse, H.A. Raue, & J. Maat Effect of pmrl disruption and 
different signal sequences on the intracellular processing and secretion of Cyamopsis tetragonoloba a-galactosidase 
5 by Saccharomyces cerevisiae. Gene 125 (1993) 115-123 

T. Ishizaki, N. Hirayama, H. Shinkawa, O. Nimi, & Y. Murooka Nucleotide Sequence of the Gene for Cholesterol 
Oxidase from a Streptomyces sp. Journal of Bacteriology 171, No.1 (Jan. 1989) 596-601 

J. Kok, K.J. Leenhouts, A.J. Haandrikman, A.M. Ledeboer.& G. Venema Nucleotide Sequence of the Cell Wall 
Proteinase Gene of Streptococcus cremoris Wg2. Applied and Environmental Microbiology 54, No. 1 (Jan. 1988) 
10 231-238 

J. Kok Genetics of the proteolytic system of lactic acid bacteria. FEMS Microbiology Reviews. 87 (1990) 15-42, esp. 
item 3.2 and Figure 7 on pages 22-23 R. Lewin The universal constructor set. New Scientist (8 December 1 990) 30-33 
P.N. Lipke, D. Wojciechowicz & J. KurjandGctf Is the Structural Gene for the Saccharomyces cerew's/aea-Agglutinin, 
a Cell Surface Glycoprotein Involved in Cell-Cell Interactions during Mating. Molecular and Cellular Biology. 9, No.8 
is (Aug. 1989)3155-3165. 

D. A. Mead, E. Szczesna-Skorupa & B. Kemper Single-stranded DNA 'blue* T7 promoter plasmids: a versatile tandem 
promoter system for cloning and protein engineering. Protein Engineering t No. 1 (1986) 67-74 

R. Orlandi, D.H. Giissow, P.T. Jones, & G. Winter Cloning immunoglobulin variable domains for expression by the 
polymerase chain reaction. Proc. Natl. Acad. Sci. USA 86 (May 1989) 3833-3837 
20 B.C. Ossendorp The non-specific tipid-transfer binding protein: a recombinant DNA and immunological approach. 
Thesis University of Utrecht (30 September 1992) T. Ohta, K. Fujishiro, K. Yamaguchi, Y. Tamura, K. Aisaka, T. 
Uwajima & M. Hasegawa Sequence of gene choB encoding cholesterol oxidase of Brevibacterium sterolicum: com- 
parison with choA of Streptomyces sp. SA-COO. Gene 103 (1991 ) 93-96 

A. Roy, C.F. Lu, D.L. Marykwas, P.N. Lipke & J. Kurjan The AGA1 Product Is Involved in Cell Surface Attachment 
25 of the Saccharomyces cerevisiae Cell Adhesion Glycoprotein a-Agglutinin. Molecular and Cellular Biology 11 No.8 
(Aug. 1991)4196-4206 

M.P. Schreuder, S. Brekelmans, H. van den Ende, & F.M. Klis Targeting of a Heterologous Protein to the Cell Wall 
of Saccharomyces cerevisiae. Yeast 9 (1993) 399-409 

A. Ullrich, L. Coussens, J.S. Hayflick, T.J. Dull, A. Gray, A.W. Tarn, J. Lee, Y. 

30 Yarden, T.A. Libermann, J. Schiessinger, J. Downward, E.L.V. Mayes, N. Whittle, 

M.D. Waterfield & P.H. Seeburg Human epidermal growth factor receptor cDNA sequence and aberrant expression 
of the amplified gene in A431 epidermoid carcinoma cells. Nature 309 (31 May 1984) 418-425 
J.M.A. Verbakel Heterologous gene expression in the yeast Saccharomyces cerevisiae. Thesis University of Utrecht 
(1 May 1991), esp. pages 76-89 

35 A. Vrielink, L.F. Lloyd, & D.M. Blow Crystal Structure of Cholesterol Oxidase from Brevibacterium sterolicum Refined 
at 1 .8 A Resolution. J. Mol. Biol., 219 (1991 ) 533-554 

E. S. Ward, D. Giissow, A.D. Griffiths, P.T. Jones & G. Winter. Binding activities of a repertoire of single immu- 
noglobulin variable domains secreted from E. coli. Nature 341_ (12 October 1989) 544-546 

[0099] Information on a deposit of a micro-organism under the Budapest Treaty is given on page 26, lines 5-7 above . 
40 in agreement with Rule 28 (4) EPC, or a similar arrangement for a State not being a Contracting State of the EPC, it 
is hereby requested that a sample of such deposit, when requested, will be submitted to an expert only. 

SEQUENCE LISTING 
45 [0100] 

(1) GENERAL INFORMATION: 

(i) APPLICANT: 

50 

(A) NAME: Unilever N.V. 

(B) STREET: Weena 455 

(C) CITY: Rotterdam 

(E) COUNTRY: The Netherlands 
55 (F) POSTAL CODE (ZIP): NL-301 3 AL 

(A) NAME: Unilever PLC 

(B) STREET: Unilever House Blackfriars 
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(C) CITY: London 

(E) COUNTRY: United Kingdom 

(F) POSTAL CODE (ZIP): EC4P 4BQ 

(A) NAME: Leon Gerardus J. FRENKEN 

(B) STREET: Geldersestraat 90 

(C) CITY: Rotterdam 

(E) COUNTRY: The Netherlands 

(F) POSTAL CODE (ZIP): NL-3011 MP 

(A) NAME: Pieter DE GEUS 

(B) STREET: Boeier 24 

(C) CITY: Barendrecht 

(E) COUNTRY: The Netherlands 

(F) POSTAL CODE (ZIP): NL-2991 KB 

(A) NAME: Franciscus Maria KLIS 

(B) STREET: Benedenlangs 102 

(C) CITY: Amsterdam 

(E) COUNTRY: The Netherlands 

(F) POSTAL CODE (ZIP): NL-1025 KL 

(A) NAME: Holger York TOSCHKA; c/o Langnese Iglo, BR3 

(B) STREET: Aeckern 1 

(C) CITY: REKEN 

(E) COUNTRY: Germany 

(F) POSTAL CODE (ZIP): D-48734 

(A) NAME: cornelis Theodorus VERRIPS 

(B) STREET: Hagedoorn 18 

(C) CITY: Maassluis 

(E) COUNTRY: The Netherlands 

(F) POSTAL CODE (ZIP): NL-3142 KB 

(ii) TITLE OF INVENTION: Immobilized proteins with specific binding capacities and their use in processes 
and products, 

(iii) NUMBER OF SEQUENCES: 40 

(iv) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.25 (EPO) 

(2) INFORMATION FOR SEQ ID NO: 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 231 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(vii) IMMEDIATE SOURCE: 
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(B) CLONE: fragment in pUR4119 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 

5 



10 



GAATTCGAGC 


TCATCACACA 


AACAAACAAA 


ACAAAATGAT 


GCTTTTGCAA 


GCCTTTCTTT 


60 


TCCTTTTGGC 


TGGTTTTGCA 


GCCAAAATAT 


CTGCGCAGGT 


CCAGCTGCAG 


TAATGAACCA 


120 


CGGTCACCGT 


CTCCTCACGT 


GGAGGCGGTT 


CAGGCGGAGG 


TGGCTCTGGC 


GGTGGCGGAT 


180 


CGGACATCGA 


GCTCACTCAG 


ACCAAGCTCG 


AGATCAAACG 


GTGATAAGCT 


T 


231 



(2) INFORMATION FOR SEQ ID NO: 2: 

15 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

20 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
25 (vii) IMMEDIATE SOURCE: 

(B) CLONE: linker Xhol-Nhel coding strand 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

30 

TCCAGATCAA AGGCCCATCT O 21 

(2) INFORMATION FOR SEQ ID NO: 3: 

35 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

40 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
45 (vii) IMMEDIATE SOURCE: 

(B) CLONE: linker Xhol-Nhel non-coding strand 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

50 

CTAGCACATC CCCCTTTGAT C 21 



(2) INFORMATION FOR SEQ ID NO: 4: 

55 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 21 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(vii) IMMEDIATE SOURCE: 

(B) CLONE: linker Eagl-Pstl coding strand 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

GGCCGCCCAG GTGCAGCTGC A 

(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(vii) IMMEDIATE SOURCE: 

(B) CLONE: linker Eagl-Pstl non-coding strand 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

GCTGCACCTG GGC 

(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(vii) IMMEDIATE SOURCE: 

(B) CLONE: PCR primer A (heavy chain) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

AG GTSMARCT GCAGSAGTCW GO 

(2) INFORMATION FOR SEQ ID NO: 7: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(vii) IMMEDIATE SOURCE: 

(B) CLONE: PGR primer B (heavy chain) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

TGAGGACACG GTCACCGTGG TCCCTTGCCC CC 

(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(vii) IMMEDIATE SOURCE: 

(B) CLONE: PCR primer C (light chain) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

GACATTGAGC TCACCCAGTC TCCA 

(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(vii) IMMEDIATE SOURCE: 

(B). CLONE: PCR primer D (light chain) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

CTTTCATCTC GAGCTTGGTC CC 
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(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(vii) IMMEDIATE SOURCE: 

(B) CLONE: linker EcoRI-Patl coding strand 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

AATTCCCCCO TTCACCTCCA CCTGCA 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(vii) IMMEDIATE SOURCE: 

(B) CLONE: linker EcoRI-Pstl non-coding strand 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

CCTGCACCTG AACCCCCC 

(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 714 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(vii) IMMEDIATE SOURCE: 

(B) CLONE: ScFv antitraseolide 02/01/01 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
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CTGCAGGAGT CTGGACCTGG CCTGGTCAAA CCTTCTCAGT CTCTGTCCCT CACCTGCACT 60 

GTCACTGGCT ACTCAATCAC CAGTGATTTT GCCTGGAACT GGATCCGGCA CTTTCCAGGA 120 

5 AACCAACTCG AGTGGATGGG CTACATAAGC TACAGTGCTA GCACTACCTA CAACCCATCT 180 

CTCAAAAGTC GAATCTCTCT CACTCGAGAC ACATCCAAGA ACCAGTTCTT CCTGCAGTTO 240 

AATTCTGTGA CTACTGAGGA CACAGCCACA TATTACTGTG CAACGTCCCT AACATGCTTA 300 

10 CTACGTCGGA AACGTTCTTA CTGGGGCCAA GGGACCACGG TCACCGTCTC CTCAGGTGCA 360 

GGCGGTTCAG GCGGAGGTGG CTCTGGCGGT GGCGGATCGG ACATCGAGCT CACCCAGTCT 420 

CCATCCTCCA TGTCTCTATC TCTGGGAGAC ACAGTCAGCA TCACTTGCCA TGCAACTCAG 480 

15 GACATTAGCA GTAATATAGG GTGGTTGCAG CAGAAACCAG GGAAATCATT TAAGGGCCTG S40 

ATCTATCATG GAACCAACTT GGAAGATGGT ATTCCATCAA GGTTCAGTGG CAGTGGATCT 600 

GGAGCAGATT ATTCCCTCAC CATCAGCACC CTGGAATCTG AACATTTTGC AG ACT ATT AC 660 

20 TGTGTACAGT ATGCTCAGTT TCCATTCACG TTCGGCTCGG GGACCAAGCT CGAG 714 

(2) INFORMATION FOR SEQ ID NO: 13: 
25 (j) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 734 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
30 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vii) IMMEDIATE SOURCE: 

35 

(B) CLONE: ScFv anti-HCG 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

40 



45 



50 



55 
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CCCCCGTTCA GCTCCACCTC CAGGACTCTG GGCGACACTT AGTGAAGCCT GCAGGGTCOC 
TGAAACTCTC CTGTGCAGCC TCTGGATTCG CTTTCAGTAG CTTTGACATG TCTTGGATTC 
GCCAGACTCC GGAGAAGAGG CTGGAGTGGG TCGCAAGCAT TACTAATCTT GCTACTTACA 
CCTACTATCC AGGCAGTGTG AAGGGCCGAT TCTCCATCTC CAGACACAAT CCCACGAACA 
CCCTAAACCT GCAAATGAGC AGTCTGAGGT CTGAGGACAC GCCCTTGTAT TTCTGTCCAA 
GACAGGGGAC TGCGGCACAA CCTTACTGGT ACTTCGATCT CTGGGGCCAA GGGACCAOGG 
TCACCGTCTC CTCAGGTGGA GGCGGTTCAG GCGGAGGTGG CTCTGGCGGT GGCGGATOGG 
ACATCGAGCT CACCCAGTCT CCAAAATCCA TGTCCATGTC CGTAGC AG AG AGGGTCACCT 
TGAGCTCCAA GGCCAGTGAG ACTGTGGATT CTTTTGTGTC CTGGTATCAA CAGAAACCAG 
AACAGTCTCC TAAATTGTTG ATATTCGGGG CATCCAACCG GTTCAGTGGG GTCCCCGATC 
GCTTCACTGG CAGTGGATCT GCAACAGACT TCACTCTGAC CATCAGCAGT GTGCAGGCTG 
AGGACTTTGC GGATTACCAC TGTGGACAGA CTTACAATCA TCCGTATACG TTCGGAGGGG 
GGACCAAGCT CGAG 

(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2685 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Saccharomyces cerevisiae 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: pYY105 
(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..2685 

(D) OTHER INFORMATION: /product= "Flocculation protein" /gene= M FL01 W 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 
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ATG ACA ATG CCT CAT CGC TAT ATG TTT TTG GCA GTC TTT ACA CTT CTG 48 
Met Thr Met Pro His Arg Tyr Met Phe Leu Ala Val Phe Thr Leu Leu 
1 5 10 15 

5 GCA CTA ACT AGT CTG CCC TCA GGA GCC ACA GAG GCG TGC TTA CCA GCA 96 

Ala Leu Thr Ser Val Ala Ser Gly Ala Thr Glu Ala Cys Leu Pro Ala 
20 25 30 

GGC CAG AGG AAA AGT GGG ATG AAT ATA AAT TTT TAC CAG TAT TCA TTG 144 
Gly Gin Arg Lye Ser Gly Met Asn lie Asn Phe Tyr Gin Tyr Ser Leu 
10 35 40 45 

AAA GAT TCC TCC ACA TAT TCG AAT GCA GCA TAT ATG GCT TAT GGA TAT 192 

Lys Asp Ser Ser Thr Tyr Ser Asn Ala Ala Tyr Met Ala Tyr Gly Tyr 
50 55 60 

15 GCC TCA AAA ACC AAA CTA GGT TCT GTC GGA GGA CAA ACT GAT ATC TCG 240 

Ala Ser Lys Thr Lys Leu Gly Ser Val Gly Gly Gin Thr Asp lie Ser 
65 70 75 80 

ATT GAT TAT AAT ATT CCC TGT GTT ACT TCA TCA GCC ACA TTT CCT TGT 288 
Zle Asp Tyr Aen lie Pro Cye Val Ser Ser Ser Gly Thr Phe Pro Cys 
20 85 90 95 

CCT CAA GAA GAT TCC TAT GGA AAC TGG GGA TGC AAA GGA ATG GGT GCT 336 
Pro Gin Glu Asp Ser Tyr Gly Asn Trp Gly Cys Lys Gly Met Gly Ala 
100 105 110 

25 TGT TCT AAT AGT CAA GGA ATT GCA TAC TGG AGT ACT CAT TTA TTT GGT 384 

Cys Ser Asn Ser Gin Gly lie Ala Tyr Trp Ser Thr Asp Leu Phe Gly 
115 , 120 125 

TTC TAT ACT ACC CCA ACA AAC GTA ACC CTA GAA ATG ACA GGT TAT TTT 432 
Phe Tyr Thr Thr Pro Thr Asn Val Thr Leu Glu Met Thr Gly Tyr Phe 
30 130 135 140 

TTA CCA CCA CAG ACG GGT TCT TAC ACA TTC AAG TTT CCT ACA GTT GAC 480 

Leu Pro Pro Gin Thr Gly Ser Tyr Thr Phe Lys Phe Ala Thr val Asp 

145 150 155 160 

35 GAC TCT GCA ATT CTA TCA GTA GGT GGT GCA ACC GCG TTC AAC TGT TGT 528 

Asp Ser Ala He Leu Ser Val Gly Gly Ala Thr Ala Phe Asn Cys Cys 
165 170 175 



40 



45 



50 



55 
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GCT CAA CAG CAA CCG CCG ATC ACA TCA ACG AAC TTT ACC ATT GAC GGT 576 

Ala Gin Gin Gin Pro Pro lie Thr Ser Thr Aan Phe Thr He Asp Gly 

180 185 190 

5 ATC AAG CCA TGG GGT GGA AGT TTG CCA CCT AAT ATC CAA GGA ACC GTC 624 

He Lys Pro Trp Gly Gly Ser Leu Pro Pro Asn He Glu Gly Thr Val 
195 200 205 

TAT ATG TAC CCT GGC TAC TAT TAT CCA ATG AAG GTT GTT TAC TCG AAC 672 

Tyr Met Tyr Ma Gly Tyr Tyr Tyr Pro Met Lys Val Val Tyr Ser Asn 
10 210 215 220 

GCT GTT TCT TGG GGT ACA CTT CCA ATT AGT GTG ACA CTT CCA GAT GGT 720 
Ala Val Ser Trp Gly Thr Leu Pro He Ser Val Thr Leu Pro Asp Gly 
225 230 235 240 

15 ACC ACT GTA AGT GAT GAC TTC GAA GGG TAC GTC TAT TCC TTT GAC GAT 766 

Thr Thr Val Ser Asp Asp Phe Glu Gly Tyr Val Tyr Ser Phe Asp Asp 
245 250 255 

GAC CTA AGT CAA TCT AAC TGT ACT GTC CCT GAC CCT TCA AAT TAT GCT 816 
Asp Leu Ser Gin Ser Asn Cys Thr Val Pro Asp Pro Ser Asn Tyr Ala 
20 260 265 270 

GTC AGT ACC ACT ACA ACT ACA ACG GAA CCA TGG ACC GGT ACT TTC ACT 864 

Val Ser Thr Thr Thr Thr Thr Thr Glu Pro Trp Thr Gly Thr Phe Thr 
275 280 285 

TCT ACA TCT ACT GAA ATG ACC ACC GTC ACC GGT ACC AAC GGC GTT CCA 912 

25 Ser Thr Ser Thr Glu Met Thr Thr Val Thr Gly Thr Asn Gly Val Pro 

290 295 300 

ACT GAC GAA ACC GTC ATT GTC ATC AGA ACT CCA ACC AGT GAA GGT CTA 960 

Thr Asp Glu Thr Val He Val He Arg Thr Pro Thr Ser Glu Gly Leu 

305 310 315 320 

30 

ATC AGC ACC ACC ACT GAA CCA TGG ACT GGC ACT TTC ACT TCG ACT TCC 1008 
He ser Thr Thr Thr Glu Pro Trp Thr Gly Thr Phe Thr Ser Thr Ser 
32S 330 335 

ACT GAG GTT ACC ACC ATC ACT GGA ACC AAC GGT CAA CCA ACT GAC GAA 1056 
,c Thr Glu val Thr Thr lie Thr Gly Thr Asn Gly Gin Pro Thr Asp Glu 

340 345 350 

ACT GTG ATT GTT ATC AGA ACT CCA ACC AGT GAA GGT CTA ATC AGC ACC 1104 
Thr Val He Val He Arg Thr Pro Thr Ser Glu Gly Leu He Ser Thr 
355 360 365 

40 ACC ACT GAA CCA TGG ACT GGT ACT TTC ACT TCT ACA TCT ACT GAA ATG 1152 

Thr Thr Glu Pro Trp Thr Gly Thr Phe Thr Ser Thr Ser Thr Glu Met 
370 375 380 

ACC ACC GTC ACC GGT ACT AAC GGT CAA CCA ACT GAC GAA ACC GTC ATT 1200 

Thr Thr Val Thr Gly Thr Asn Gly Gin Pro Thr Asp Glu Thr Val He 

45 385 390 395 400 

GTT ATC AGA ACT CCA ACC AGT GAA GGT TTG GTT ACA ACC ACC ACT GAA 1248 
Val He Arg Thr Pro Thr Ser Glu Gly Leu Val Thr Thr Thr Thr Glu 
405 410 415 

50 CCA TGG ACT GCT ACT TTT ACT TCG ACT TCC ACT GAA ATG TCT ACT GTC 1296 

Pro Trp Thr Gly Thr Phe Thr Ser Thr Ser Thr Glu Met Ser Thr Val 
420 425 430 

ACT GGA ACC AAT GGC TTG CCA ACT GAT GAA ACT GTC ATT GTT GTC AAA 1344 
Thr Gly Thr Asn Gly Leu Pro Thr Asp Glu Thr Val He Val Val Lys 
55 435 440 445 
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ACT CCA ACT ACT GCC ATC TCA TCC ACT TTG TCA TCA TCA TCT TCA GGA 
Thr Pro Thr Thr Ala lie Sec Ser Ser Leu Ser Ser Ser Ser Ser Gly 
450 455 460 

CAA ATC ACC AGC TCT ATC ACC TCT TCG CGT CCA ATT ATT ACC CCA TTC 
Gin He Thr Ser Ser He Thr Ser Ser Arg Pro He He Thr Pro Phe 
465 470 475 480 

TAT CCT AGC AAT GGA ACT TCT GTG ATT TCT TCC TCA GTA ATT TCT TCC 
Tyr Pro Ser Asn Gly Thr Ser Val He Ser Ser Ser Val He Ser Ser 
485 490 495 

TCA GTC ACT TCT TCT CTA TTC ACT TCT TCT CCA GTC ATT TCT TCC TCA 

Ser val Thr Ser Ser Leu Phe Thr Ser Ser Pro Val He Ser Ser Ser 
500 505 510 

GTC ATT TCT TCT TCT ACA ACA ACC TCC ACT TCT ATA TTT TCT GAA TCA 
Val He Ser Ser Ser Thr Thr Thr Ser Thr ser He Phe Ser Glu Ser 
515 520 525 

TCT AAA TCA TCC GTC ATT CCA ACC AGT AGT TCC ACC TCT GGT TCT TCT 
Ser Lys Ser Ser Val He Pro Thr Ser Ser Ser Thr Ser Gly Ser ser 
530 535 540 

GAG AGC GAA ACG AGT TCA CCT GCT TCT GTC TCT TCT TCC TCT TTT ATC 
Glu ser Glu Thr Ser Ser Ala Gly Ser Val ser Ser Ser Ser Phe He 
545 550 555 560 

TCT TCT GAA TCA TCA AAA TCT CCT ACA TAT TCT TCT TCA TCA TTA CCA 
ser Ser Glu ser Ser Lys Ser Pro Thr Tyr Ser Ser Ser Ser Leu Pro 
565 570 575 

CTT GTT ACC AGT GCG ACA ACA AGC CAG GAA ACT GCT TCT TCA TTA CCA 
Leu val Thr Ser Ala Thr Thr Ser Gin Glu Thr Ala Ser Ser Leu Pro 
S80 585 590 

CCT GCT ACC ACT ACA AAA ACG AGC CAA CAA ACC ACT TTG GTT ACC GTG 

Pro Ala Thr Thr Thr Lys Thr Ser Glu Gin Thr Thr Leu Val Thr Val 
595 . 600 60S 

ACA TCC TGC GAG TCT CAT GTG TGC ACT GAA TCC ATC TCC CCT GCG ATT 

Thr Ser Cys Glu Ser Hie Val Cys Thr Glu Ser He ser Pro Ala He. 
610 615 620 

GTT TCC ACA GCT ACT GTT ACT GTT AGC GGC GTC ACA ACA GAG TAT ACC 
Val Ser Thr Ala Thr Val Thr Val Ser Gly Val Thr Thr Glu Tyr Thr 
625 630 635 640 

ACA TGG TGC CCT ATT TCT ACT ACA GAG ACA ACA AAG CAA ACC AAA GGG 
Thr Trp Cys Pro He Ser Thr Thr Glu Thr Thr Lys Gin Thr Lye Gly 
645 650 655 

ACA ACA GAG CAA ACC ACA GAA ACA ACA AAA CAA ACC ACG GTA GTT ACA 
Thr Thr Glu Gin Thr Thr Glu Thr Thr Lys Gin Thr Thr Val Val Thr 
660 66S 670 

ATT TCT TCT TGT GAA TCT GAC -GTA TGC TCT AAG ACT GCT TCT CCA GCC 
He Ser Ser Cys Glu Ser Asp val Cys Ser Lys Thr Ala Ser Pro Ala 
675 680 685 

ATT GTA TCT ACA AGC ACT GCT ACT ATT AAC GGC GTT ACT ACA CAA TAC 

He Val Ser Thr Ser Thr Ala Thr He Asn Gly Val Thr Thr Glu Tyr 

690 695 700 

ACA ACA TGG TCT CCT ATT TCC ACC ACA GAA TCG AGG CAA CAA ACA ACG 
Thr Thr Trp Cys Pro He Ser Thr Thr Glu Ser Arg Gin Gin Thr Thr 
70S * 710 715 720 
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CTA GTT ACT GTT ACT TCC TGC GAA TCT GGT GTG TGT TCC GAA ACT GCT 

Leu Val Thr Val Thr Ser Cys Glu Ser Gly Val Cys Ser Glu Thr Ala 
72S " 730 735 

TCA CCT GCC ATT GTT TCG ACG GCC ACG GCT ACT GTG AAT CAT GTT GTT 
Ser Pro Ala He Val Ser Thr Ala Thr Ala Thr Val Asn Asp Val Val 
740 745 750 

ACG GTC TAT CCT ACA TCG ACG CCA CAG ACT GCG AAT GAA GAG TCT GTC 
Thr Val Tyr Pro Thr Trp Arg Pro Gin Thr Ala Asn Glu Glu Ser Val 
755 760 765 

AGC TCT AAA ATG AAC AGT GCT ACC GGT GAG ACA ACA ACC AAT ACT TTA 
Ser Ser Lys Met Asn Ser Ala Thr Gly Glu Thr Thr Thr Asn Thr Leu 
770 775 780 

GCT GCT GAA ACG ACT ACC AAT ACT GTA GCT GCT GAG ACG ATT ACC AAT 
Ala Ala Glu Thr Thr Thr Asn Thr Val Ala Ala Glu Thr He Thr Aan 
785 790 795 800 

ACT GGA GCT GCT GAG ACG AAA ACA GTA GTC ACC TCT TCG CTT TCA AGA 

Thr Gly Ala Ala Glu Thr Lys Thr Val Val Thr Ser Ser Leu Ser Arg 

60S 810 815 

TCT AAT CAC CCT GAA ACA CAG ACG GCT TCC GCG ACC GAT GTG ATT GCT 
Ser Aon His Ala Glu Thr Gin Thr Ala Ser Ala Thr Asp Val He Gly 
B20 825 830 

CAC AGC AGT AGT GTT GTT TCT GTA TCC GAA ACT GGC AAC ACC AAG AGT 
His Ser Ser Ser Val Val Ser Val Ser Glu Thr Gly Asn Thr Lys Ser 
835 840 845 

CTA ACA AGT TCC GCG TTG AGT ACT ATG TCG CAA CAG CCT CCT AGC ACA 
Leu Thr Ser Ser Gly Leu Ser Thr Met Ser Gin Gin Pro Arg Ser Thr 
850 855 860 

CCA CCA AGC AGC ATG GTA GGA TAT AGT ACA GCT TCT TTA CAA ATT TCA 

Pro Ala Ser Ser Met Val Gly Tyr Ser Thr Ala Ser Leu Glu He Ser 
865 870 875 880 

ACG TAT GCT GGC AGT GGA ACA GCT TAC TGG CCG CTA GTG GTT TAA 
Thr Tyr Ala Gly Ser Ala Thr Ala Tyr Trp Pro Val Val Val 

885 890 895 



(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 894 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 
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Met Thr Met Pro His Arg Tyr Met Phe Leu Ala Val Phe Thr Leu Leu 
15 10 15 

Ala Leu Thr Ser Val Ala Ser Gly Ala Thr Glu Ala Cys Leu Pro Ala 
5 20 25 30 

Gly Gin Arg Lys Ser Gly Met Asn He Asn Phe Tyr Gin Tyr Ser Leu 
•35 40 45 

Lys Asp Ser Ser Thr Tyr Ser Asn Ala Ala Tyr Met Ala Tyr Gly Tyr 
10 50 55 60 
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Ala Ser Lys Thr Lys Leu Gly Ser Val Gly Gly Gin Thr Asp He Ser 

65 70 75 80 

He Asp Tyr Asn He Pro Cys VaL Ser Ser Ser Gly Thr Phe Pro Cys 
85 90 95 

Pro Gin Glu Asp Ser Tyr Gly Asn Trp Gly Cys Lys Gly Met Gly Ala 
100 105 110 

Cys Ser Asn Ser Gin Gly He Ala Tyr Trp Ser Thr Asp Leu Phe Gly 
115 120 125 

Phe Tyr Thr Thr Pro Thr Asn Val Thr Leu Glu Met Thr Gly Tyr Phe 
130 135 140 

Leu Pro Pro Gin Thr Gly Ser Tyr Thr Phe Lys Phe Ala Thr Val Asp 
145 150 155 160 

Asp Ser Ala lie Leu Ser Val Gly Gly Ala Thr Ala Phe Aan Cya Cya 
165 170 175 

Ala Gin Gin Gin Pro Pro lie Thr Ser Thr Aan Phe Thr lie Asp Gly 
180 185 190 

He Lya Pro Trp Gly Gly Ser Leu Pro Pro Asn He Glu Gly Thr Val 

195 200 205 

Tyr Met Tyr Ala Gly Tyr Tyr Tyr Pro Met Lys Val val Tyr Ser Asn 
210 215 220 

Ala Val Ser Trp Gly Thr Leu Pro He Ser Val Thr Leu Pro Aap Oly 
225 230 235 240 

Thr Thr Val Ser Asp Aap Phe Glu Gly Tyr Val Tyr Ser Phe Asp Aap 
245 250 255 

Aap Leu Ser Cln Ser Aan Cya Thr Val Pro Aap Pro Ser Aan Tyr Ala 
260 265 270 

Val Ser Thr Thr Thr Thr Thr Thr Glu Pro Trp Thr Gly Thr Phe Thr 
275 280 285 

Ser Thr ser Thr Glu Met Thr Thr Val Thr Gly Thr Asn Gly Val Pro 
290 295 300 

Thr Asp Glu Thr Val He Val He Arg Thr Pro Thr Ser Glu Gly Leu 
305 310 315 320 

He Ser Thr Thr Thr Glu Pro Trp Thr Gly Thr Phe Thr Ser Thr Ser 
325 330 335 

Thr Glu Val Thr Thr lie Thr Gly Thr Asn Gly Gin Pro Thr Aap Glu 
340 345 350 

Thr Val He Val He Arg Thr Pro Thr Ser Glu Gly Leu He Ser Thr 
355 360 365 

Thr Thr Glu Pro Trp Thr Gly Thr Phe Thr Ser Thr Ser Thr Glu Met 

370 375 3SO 

Thr Thr Val Thr Gly Thr Asn Gly Cln Pro Thr Asp Glu Thr Val lie 
385 390 395 400 

Val He Arg Thr Pro Thr Ser Glu Gly Leu Val Thr Thr Thr Thr Glu 
405 410 415 
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Pro Trp Thr Gly Thr Phe Thr Ser Thr Ser Thr Glu Met Ser Thr Val 
420 425 430 

Thr Gly Thr Asn Gly Leu Pro Thr Asp Glu Thr Val He Val Val Lys 
435 440 445 

Thr Pro Thr Thr Ala He Ser Ser Ser Leu Ser Ser Ser Ser Ser Gly 
450 455 460 

Gin He Thr Ser Ser He Thr Ser Ser Arg Pro He He Thr Pro Phe 
465 470 475 480 

Tyr Pro Ser Asn Gly Thr Ser Val He Ser Ser Ser Val He ser Ser 
485 490 495 

Ser Val Thr Ser Ser Leu Phe Thr Ser Ser Pro Val He Ser Ser Ser 
500 505 510 

Val He Ser Ser Ser Thr Thr Thr Ser Thr Ser He Phe Ser Glu Ser 

SIS 520 525 

Ser Lys Ser Ser Val He Pro Thr Ser Ser Ser Thr Ser Gly Ser Ser 
530 535 540 

Glu Ser Glu Thr Ser Ser Ala Gly Ser Val Set Ser Ser ser Phe He 
545 550 555 560 

Ser Ser Glu Ser Ser Lys Ser Pro Thr Tyr Ser Ser Ser Ser Leu Pro 
565 570 575 

Leu Val Thr Ser Ala Thr Thr Ser Gin Glu Thr Ala Ser Ser Leu Pro 
580 585 590 

Pro Ala Thr Thr Thr Lys Thr Ser Glu Gin Thr Thr Leu Val Thr Val 
59S 600 60S 

Thr Ser Cys Glu Ser His Val cys Thr Glu Ser He Ser Pro Ala Ha 
610 615 620 

Val Ser Thr Ala Thr Val Thr Val ser Gly Val Thr Thr Glu Tyr Thr 
625 630 635 640 

Thr Trp Cys Pro He Ser Thr Thr Glu Thr Thr Lys Gin Thr Lys Gly 
645 650 655 

Thr Thr Glu Gin Thr Thr Glu Thr Thr Lyo Gin Thr Thr Val Val Thr 
660 665 670 

He Ser Ser Cys Glu Ser Asp Val cys Ser Lys Thr Ala Ser Pro Ala 
675 680 685 

He Val Ser Thr Ser Thr Ala Thr He Asn Gly Val Thr Thr Glu Tyr 
690 695 700 

Thr Thr Trp Cys Pro He Ser Thr Thr Glu Ser Arg Gin Gin Thr Thr 
705 710 715 720 

Leu Val Thr Val Thr Ser Cys Glu Ser Gly Val Cys Ser Glu Thr Ala 

725 730 735 

Ser Pro Ala He Val Ser Thr Ala Thr Ala Thr Val Asn Asp Val Val 
740 745 750 

Thr Val Tyr Pro Thr Trp Arg Pro Gin Thr Ala Asn Glu Glu Ser Val 
755 760 765 . 
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Ser Ser Lys Met Asn Ser Ala Thr Gly Glu Thr Thr Thr Asn Thr Leu 
770 775 780 

Ala Ala Glu Thr Thr Thr Asn Thr Val Ala Ala Glu Thr lie Thr Asn 
7B5 790 795 800 

Thr Giy Ala Ala Glu Thr Lys Thr Val Val Thr Ser Ser Leu Ser Arg 

80S 810 815 

Ser Asn His Ala Glu Thr Gin Thr Ala Ser Ala Thr Asp Val lie Gly 
820 825 830 

His Ser Ser Ser Val Val Ser Val Ser Glu Thr Gly Asn Thr Lys Ser 
835 840 845 

Leu Thr Ser Ser Gly Leu Ser Thr Met Ser Gin Gin Pro Arg Ser Thr 
650 855 860 

Pro Ala Ser Ser Met Val Gly Tyr Ser Thr Ala Ser Leu Glu lie Ser 
865 870 875 880 

Thr Tyr Ala Gly Ser Ala Thr Ala Tyr Trp Pro Val Val Val 
885 890 

(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPES DNA (genomic) 
(vii) IMMEDIATE SOURCE: 

(B) CLONE: ChoB template coding strand 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 



GCCCCCAGCC GCACCCTCG 

(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(vii) IMMEDIATE SOURCE: 

(B) CLONE: ChoB template non-coding strand 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

CGAGGGTGCG GCTGGGGGC 

(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(vii) IMMEDIATE SOURCE: 

(B) CLONE: choOlpcr primer 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

AGATCTCAAT TCCCGGCCGC CCCCAGCCGC RCCCTCC 

(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(vii) IMMEDIATE SOURCE: 

(B) CLONE: cho02pcr primer 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

AGATCTAACC TTTC AGCTAQ CCTOGATQTC CCACCACATC AT 

(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(vii) IMMEDIATE SOURCE: 
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(B) CLONE: ChoB template coding strand 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 

ATCATCTCCT CCGACATCCA C 

(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(vii) IMMEDIATE SOURCE: 

(B) CLONE: ChoB template non-coding strand 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 

CTGGATGTCG GACGACATGA T 

(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(vii) IMMEDIATE SOURCE: 

(B) CLONE: mutagenesis primer ChoB 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 

CGCCGCCACC CCACCCCCCT ATCCACTCCC GATCACGACO 

(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
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(vii) IMMEDIATE SOURCE: 

(B) CLONE: ChoB template coding strand 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 

CCCCTCGTCA TCCGCAGTCC ATACGGCGCT CCCCTOGCOC CO 

(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(vii) IMMEDIATE SOURCE: 

(B) CLONE: primer prtl 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 

AAGATCTATC GATCTTCTTA CCCCGTACA 

(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(vii) IMMEDIATE SOURCE: 

(B) CLONE: proteinase template non-coding strand 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 

GACTGTACCG GCTAACAAGA TCGATAGCCC TT 

(2) INFORMATION FOR SEQ ID NO: 26: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: DNA (genomic) 
(vii) IMMEDIATE SOURCE: 

(B) CLONE: proteinase template coding strand 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 

GTCGGCGAAA TCCAACCAAA GGCGGCT 

(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: . 

(A) LENGTH: 39 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(vii) IMMEDIATE SOURCE: 

(B) CLONE: prt2 primer 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 

CCCAAGCTTC CCCCCGGCOG TTGCTTGGAT TTCGCCGAC 

(2) INFORNATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(vii) IMMEDIATE SOURCE: 

(B) CLONE: EGF1 primer 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 

CCGGCCCCCC CGCTCGACQA AAAGAAAGTT TCC 

(2) INFORMATION FOR SEQ ID NO: 29: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: DNA (genomic) 
(vii) IMMEDIATE SOURCE: 

(B) CLONE: EGF receptor template non-coding strand 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 

GCAAACTTTC TTTTCCTCCA GAGCCCGACT CGC 

(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(vii) IMMEDIATE SOURCE: 

(B) CLONE: EGF receptor template coding strand 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 

AATGGGCCTA ACATCCCGTC CATCGCCACT 

(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(vii) IMMEDIATE SOURCE: 

(B) CLONE: EGF2 primer 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 

CCCCRAC C TT AACCCTACOG GACCCGATCT TACCCCCATT 

(2) INFORMATION FOR SEQ ID NO: 32: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 177 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(vii) IMMEDIATE SOURCE: 

(B) CLONE: VhC - AGa1 linker with MycT and Hinge 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 

GAATTCCACC TCACCGTCTC CTCAGAACAA AAACTCATCT CAGAACAGGA TCTGAATGAA 
CCAAAGATTC CACAACCTCA ACCAAAGCCA CAACCTCAAC CACAACCACA ACCAAAAOCT 
CAACCAAAGC CAGAACCAGA ATCTACTTCC CCAAAGTCTC CAGCTAGCCT TAAGCTT 

(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 63 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(vii) IMMEDIATE SOURCE: 

(B) CLONE: VhC - AGa1 linker with MycT 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 

GAATTCCACG TCACCGTCTC CTCAGAACAA AAACTCATCT CAGAACAGCA TCTGAATGCT 
AGC 

(2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 144 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(vii) IMMEDIATE SOURCE: 

(B) CLONE: VhC - AGa1 linker with Hinge 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 
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GAATTCCACG TCACCCTCTC CTCACAACCA AAGATTCCAC AACCTCAACC AAAGCCACAA 
CCTCAACCAC AACCACAACC AAAACCTCAA CCAAAGCCAG AACCAGAATC TACTTCCCCA 
AAGTCTCCAG CTAGCCTTAA GCTT 

(2) INFORMATION FOR SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 119 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(vii) IMMEDIATE SOURCE: 

(B) CLONE: fragment in pUR4421 coding strand 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 

AATTTACCGG COGCCCAGGT GAAACTGCTC GAGTAAGTGA CTAACCTCAC CGTCTCCTCA 
GAACAAAAAC TCATCTCAGA AGAGGATCTG AATTAATCAG AATTCATCAA ACGGTCATA 

(2) INFORMATION FOR SEQ ID NO: 36: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 119 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(vii) IMMEDIATE SOURCE: 

(B) CLONE: fragment in pUR4421 non-coding strand 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 

AGCTTATCAC CGTTTGATGA ATTCTCATTA ATTCAGATCC TCTTCTGAGA TGAGTTTTTG 
TTCTGAGGAG ACGGTGACCT TAGTCACTTA CTCGAGCAGT TTCACCTGGG CGGCCGCTA 

(2) INFORMATION FOR SEQ ID NO: 37: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 11 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
(vii) IMMEDIATE SOURCE: 

(B) CLONE: Myc tail 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 

Clu Gin Lyfl Leu lie Ser Glu Glu Asp Leu Asn 
15 10 

(2) INFORMATION FOR SEQ ID NO: 38: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(vii) IMMEDIATE SOURCE: 

(B) CLONE: BstEII-Hindlll linker coding strand 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38: 

CTCACCCTCT CCTCATAATC A 

(2) INFORMATION FOR SEQ ID NO: 39: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(vii) IMMEDIATE SOURCE: 

(B) CLONE: BstEII Hindlll linker non-coding strand 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39: 

AGCTTCATTA TCACCACACG 

(2) INFORMATION FOR SEQ ID NO: 40: 



39 



EP 0 682 710 B1 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vii) IMMEDIATE SOURCE: (B) CLONE: primer cho03pcr 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40: 

CGGATCCAAG CTTGAGCCTG GATGTCGGAC GAGATGAT 38 



Claims 

1. A process for carrying out an isolation process by using an immobilised binding protein or functional part thereof 
still capable of binding to a specific compound, wherein the immobilised protein is localized at the exterior of the 
cell wall of a host cell, wherein a medium containing said specific compound is contacted with a host cell under 
conditions whereby a complex between said specific compound and said immobilised binding protein is formed, 
separating said complex from the medium originally containing said specific compound, wherein the host cell is a 
fungus selected from the group consisting of yeasts and moulds, containing an expressible polynucleotide com- 
prising 

(i) a structural gene encoding the binding protein or a functional part thereof still having the specific binding 
capability, said binding protein or said functional part thereof being localized at the cell wall of said fungus, and 

(ii) at least a part of a gene encoding an anchoring protein capable of anchoring in the cell wall of said fungus, 
said part of a gene encoding at least the anchoring part of said anchoring protein, which anchoring part is 
derivable from the C-terminal part of said anchoring protein; 

said polynucleotide being present in a vector or in the chromosome of said fungus. 

2. A process according to claim 1 , wherein the fungus is selected from the group consisting of yeasts belonging to 
the genera Candida, Debaryomyces, Hansenula, Kluyveromyces, Pichia and Saccharomyces and moulds belong- 
ing to the genera Aspergillus, Penicillium and Rhizopus can be used. 

3. Process according to claim 1 or 2 wherein the protein capable of anchoring in the cell wall is selected from the 
group consisting of, AGA1 (=a-agglutinin) of S. cerevisiae, FL01 (= floccuiation protein), Major Cell Wall Protein 
of lower eukaryotes, selected from the group consisting of yeasts and fungi. 

4. Process according to any of claims 1-3 wherein said fungus further comprises a sequence encoding a signal 
peptide ensuring secretion of the expression product of said first polynucleotide. 

5. Process according to any of claims 1-4 wherein the binding protein is an antibody, an antibody fragment, a com- 
bination of antibody fragments, a receptor protein, or an inactivated enzyme still capable of binding the correspond- 
ing substrate. 



Patentanspruche 

1 . Verfahren zum Durchfuhren eines Isotierungsverfahrens durch Verwenden eines immobilisierten Bindungsproteins 
Oder eines funktionalen Teils davon, das (der) immer noch zur Bindung einer spezifischen Verbindung fahig ist, 
wobei das immobilisierte Protein an der AuGenseite der Zellwand einer Wirtszelle lokalisiert ist, wobei ein Medium, 
das die spezifische Verbindung enthait, mit einer Wirtszelle unter Bedingungen in Kontakt gebracht wird, unter 
denen ein Komplex zwischen der spezifischen Verbindung und dem immobilisierten Bindungsprotein gebildet wird 
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und der Komplex von dem die spezifische Verbindung ursprunglich enthaltenden Medium getrennt wird, wobei die 
Wirtszelle ein Pilz ist, ausgewahlt aus der Gruppe, die aus Hefen und Schimmelpilzen besteht, enthaltend ein 
exprimierbares Polynucleotid, umfassend 

5 (i) ein Strukturgen, das fur das Bindungsprotein oder einen funktionalen Teil davon, der immer noch die spe- 

zifische Bindungskapazitat aufweist, codiert, wobei das Bindungsprotein oder der funktionale Teil davon auf 
der Zellwand des Pilzes lokalisiert ist, und 

(ii) zumindest einen Teil eines Genes, das fur ein Ankerprotein codiert, das zur Verankerung in der Zellwand 
10 des Pilzes fShig ist, wobei der Teil eines Genes zumindest fur den Verankerungsteil des Ankerproteins codiert, 

wobei der Verankerungsteil vom C-terminalen Teil des Ankerproteins ableitbar ist; 

wobei das Polynucleotid in einem Vektor oder Chromosom des Pilzes vorliegt. 

15 2. Verfahren gemSfi Anspruch 1 , wobei der Pilz aus der Gruppe ausgewahlt ist, die aus Hefen besteht, die zu den 
Genera Candida, Debaryomyces, Nansenula, Kluyveromyces, Pichia und Saccharomyces besteht und daruber 
hinaus Schimmelpilze verwendet werden kdnnen, die zu den Genera Aspergillus, Penicillium und Rhizopus ge- 
horen. 

20 3. Verfahren gemSli Anspruch 1 oder 2, wobei das zum Verankern in der Zellwand fahige Proteine aus der Gruppe 
ausgewahlt ist, die besteht aus: AGA1 (=a-Agglutinin) von S. cerevisiae, FL01 (= Flocculationsprotein), Haupt- 
zellwandprotein von niederen Eukaryoten, ausgewahlt aus der Gruppe, die aus Hefen und Pilzen besteht. 

4. Verfahren gemaft mindestens einem der Anspruche 1 bis 3, wobei der Pilz daruber hinaus eine Sequenz umfasst, 
25 die fur ein Sigalpeptid codiert, wodurch die Sekretion des Expressionsproduktes des ersten Polynucleotids sicher- 

gestellt wird. 

5. Verfahren gemSfi mindestens einem der Anspruche 1 bis 4, wobei das Bindungsprotein ein Antikorper, ein Anti- 
kdrperfragment, eine Kombination von Antikorperfragmenten, ein Rezeptorprotein oder ein inaktiviertes Enzym 

30 ist, das immer noch zur Bindung des entsprechenden Substrats fahig ist. 



Revendications 

35 1. Processus adapte pour realiser un processus d'isolation en utilisant une proteine de liaison immobilisee ou une 
partie fonctionnelle de celle-ci toujours capable de se lier a un compose specifique, dans lequel la proteine immo- 
bilisee est situee a I'exterieur de la paroi cellulaire d'une cellule note, dans lequel un milieu contenant ledit compose 
specifique est mis en contact avec une cellule note dans des conditions ou un complexe entre ledit compose 
specifique et ladite proteine de liaison immobilisee est forme, separant ledit complexe du milieu contenant initia- 

40 lement ledit compose specifique, dans lequel la cellule note est un champignon choisi dans le groupe constitue 

par les levures et les moisissures, contenant un polynucleotide d'expression comprenant 

(i) un gene structurel codant la proteine de liaison ou une partie fonctionnelle de celle-ci ayant toujours la 
capacite de liaison specifique, ladite proteine de liaison ou ladite partie fonctionnelle de celle-ci etant situee 

45 au niveau de la paroi cellulaire dudit champignon, et 

(ii) au moins une partie d'un gene codant une proteine d'ancrage capable de se fixer a la paroi cellulaire dudit 
champignon, ladite partie d'un gene codant au moins la partie de fixation de ladite proteine d'ancrage, laquelle 
partie de fixation est susceptible de deriver de la partie C-terminal de ladite proteine d'ancrage ; 

50 ledit polynucleotide etant present dans un vecteur ou dans le chromosome dudit champignon. 

2. Processus selon la revendication 1 dans lequel il est possible d'utiliser le champignon choisi dans le groupe cons- 
titue par les levures appartenant aux genres Candida, Debaryomyces, Hansenula, Kluyveromyces, Pichia et Sac- 
charomyces et les moisissures appartenant aux genres Aspergillus, Penicillium et Rhizopus. 

55 

3. Processus selon la revendication 1 ou 2 dans lequel la proteine capable de se fixer a la paroi cellulaire est choisie 
dans le groupe constitue par AGA1 (agglutinine A) de S. cerevisiae, FL01 (proteine de floatation), proteine ma- 
jeure de paroi cellulaire des eucaryotes inferieurs, selectionnee dans le groupe constitue par les levures et les 
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champignons. 

Processus selon Tune quelconque des revendications 1 a 3 dans lequel ledit champignon comprend en outre une 
sequence encodant un peptide signal garantissant la secretion du produit depression dudit premier polynucleo- 
tide. 

Processus selon Tune quelconque des revendications 1 a 4 dans lequel la proteine de liaison est un anticorps, un 
fragment d'anticorps, une combinaison de fragments d'anticorps, une proteine receptrice ou une enzyme inactivee 
toujours capable de se lier au substrat correspondant. 
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